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About This Guide 



This guide provides component information, step-by-step instructions for installation, 
reference information for operation, troubleshooting, and migration and upgrade 
information for the Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA 1000. 

Intended Audience 

This guide is intended for persons with administrator-level experience. This guide 
assumes you are familiar with basic cluster hardware and software configurations, as 
well as cluster management and operation. 

A Important Safety Information 

Before installing this product, read the Important Safety Information document 
provided. 

Symbols on Equipment 

The following symbols may be placed on equipment to indicate the presence of 
potentially hazardous conditions: 



^\ WARNING: This symbol, in conjunction with any of the following symbols, 
/ I \ indicates the presence of a potential hazard. The potential for injury exists if 
warnings are not observed. Consult your documentation for specific details. 
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■A This symbol indicates the presence of hazardous energy circuits or electric 
// \ shock hazards. Refer all servicing to qualified personnel. 

WARNING: To reduce the risk of injury from electric shock hazards, do not 
open this enclosure. Refer all maintenance, upgrades, and servicing to 
qualified personnel. 



<8> 



This symbol indicates the presence of electric shock hazards. The area 
contains no user or field serviceable parts. Do not open for any reason. 

WARNING: To reduce the risk of injury from electric shock hazards, do not 
open this enclosure. 



This symbol on an RJ-45 receptacle indicates a network interface connection. 

WARNING: To reduce the risk of electric shock, fire, or damage to the 
equipment, do not plug telephone or telecommunications connectors into this 
receptacle. 




This symbol indicates the presence of a hot surface or hot component. If this 
surface is contacted, the potential for injury exists. 



WARNING: To reduce the risk of injury from a hot component, allow the 
surface to cool before touching. 




These symbols, on power supplies or systems, indicate that the 
equipment is supplied by multiple sources of power. 



WARNING: To reduce the risk of injury from electric shock, 
remove all power cords to completely disconnect power from the 
system. 



viii 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 Administrator Guide 



About This Guide 



Rack Stability 

WARNING: To reduce the risk of personal injury or damage to the equipment, 
be sure that: 

• The leveling jacks are extended to the floor. 

• The full weight of the rack rests on the leveling jacks. 

• The stabilizing feet are attached to the rack if it is a single-rack installation. 

• The racks are coupled together in multiple-rack installations. 

• Only one component is extended at a time. A rack may become unstable if 
more than one component is extended for any reason. 



Symbols in Text 

These symbols may be found in the text of this guide. They have the following 
meanings. 



WARNING: Text set off in this manner indicates that failure to follow directions 
in the warning could result in bodily harm or loss of life. 



A CAUTION: Text set off in this manner indicates that failure to follow directions could 
result in damage to equipment or loss of information. 



IMPORTANT: Text set off in this manner presents clarifying information or specific 
instructions. 

NOTE: Text set off in this manner presents commentary, sidelights, or interesting points of 
information. 
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Text Conventions 

This document uses the following conventions: 

• Italic type is used for complete titles of published guides or variables. Variables 
include information that varies in system output, in command lines, and in 
command parameters in text. 

• Bold type is used for emphasis, for onscreen interface components (window 
titles, menu names and selections, button and icon names, and so on), and for 
keyboard keys. 

• Monospace typeface is used for command lines, code examples, screen 
displays, error messages, and user input. 

• Sans serif typeface is used for uniform resource locators (URLs). 

Getting Help 

If you have a problem and have exhausted the information in this guide, you can get 
further information and other help in the following locations. 

Compaq Technical Support 

In North America, call the Compaq Technical Support Phone Center at 
1-800-OK-COMPAQ. This service is available 24 hours a day, 7 days a week. For 
continuous quality improvement, calls may be recorded or monitored. Outside North 
America, call the nearest Compaq Technical Support Phone Center. Telephone 
numbers for worldwide Technical Support Centers are listed on the Compaq website, 
www.compaq.com. 

Be sure to have the following information available before you call Compaq: 

• Technical support registration number (if applicable) 

• Product serial number 

• Product model name and number 

• Applicable error messages 
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• Add-on boards or hardware 

• Third-party hardware or software 

• Operating system type and revision level 

Compaq Website 

The Compaq website has information on this product as well as the latest drivers and 
Flash ROM images. You can access the Compaq website at www.compaq.com. 

Compaq Authorized Reseller 

For the name of your nearest Compaq authorized reseller: 

• In the United States, call 1-800-345-1518. 

• In Canada, call 1-800-263-5868. 

• Elsewhere, see the Compaq website for locations and telephone numbers. 

Reader's Comments 

Compaq welcomes your comments on this guide. Please send your comments and 
suggestions by email to ServerDocumentation@compaq. com. 
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Architecture of the Compaq ProLiant 
Clusters HA/F100 and HA/F200 for 

MSA1000 



Overview of Components 

A cluster is a loosely coupled collection of servers and storage that acts as a single 
system, presents a single-system image to clients, provides protection against system 
failures, and provides configuration options for static load balancing. 

Clustering is an established technology that can provide one or more of the following 
benefits: 

• Availability 

• Scalability 

• Manageability 

• Investment protection 

• Operational efficiency 

This chapter discusses the role each of these products plays in bringing a complete 
clustering solution to your computing environment. 
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Compaq ProLiant Cluster HA/F100 for MSA1000 

The Compaq ProLiant™ Cluster HA/F100 for MSA1000 (HA/F100 for MSA1000) 
includes these hardware solution components: 

• Two Compaq ProLiant servers 

• One or more Compaq StorageWorks™ Modular SAN Array 1000 storage 
systems (MSA 1000) 

• One Compaq StorageWorks Modular SAN Array 1000 Controller per MSA 1000 
storage system (MSA 1000 Controller) 

• One of the following hubs or switches: 

— Compaq StorageWorks Fibre Channel Storage Hub 7 (7 -port FC-AL hub) 

— Compaq StorageWorks Fibre Channel SAN Switch 8 (SAN Switch/8) 

— Compaq StorageWorks Fibre Channel SAN Switch 16 (SAN Switch/16) 

— Compaq StorageWorks Fibre Channel SAN Switch 8-EL (SAN Switch/8EL) 

— Compaq StorageWorks Fibre Channel SAN Switch 16-EL 
(SAN Switch/1 6EL) 

— Compaq StorageWorks Modular SAN Array 1000 Fabric Switch 6 
(MSA Fabric Switch 6) 

• One Compaq StorageWorks FCA-2101 host bus adapter per server 
(FCA-2101 HBA) 

• Network interface cards (NICs) 

• Gigabit Interface Converter-Shortwave (GBIC-SW) modules 

• Cables: 

— Multimode Fibre Channel cable 

— Ethernet crossover cable (included in the cluster kit) 

— CAT 3, 4, or 5 UTP (2 or 4 pair) cable 
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The HA/F100 for MSA 1000 uses these software solution components: 

• One of the following Microsoft Windows operating systems: 

— Microsoft Windows NT Server 4.0, Enterprise Edition (Windows Enterprise 
Edition) with Microsoft Cluster Server (MSCS) 

— Microsoft Windows 2000 Advanced Server with Microsoft Cluster Service 
(MSCS) 

• Compaq SmartStart™ and Support Software CD (included in the Server Setup 
and Management Pack) 

• Compaq Modular SAN Array 1000 Support Software CD 

• Compaq Management tools 

— Compaq Insight Manager™ (optional) 

— Compaq Insight Manager XE (optional) 

— Compaq Insight Manager 7 (optional) 

• Compaq Intelligent Cluster Administrator (optional) 

NOTE: See Appendix A for the necessary software version levels for your cluster. 
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The following illustration depicts the HA/F100 for MSA 1000 configuration. 



Figure 1-1 : Hardware components of the Compaq 
ProLiant Cluster HA/F1 00 for MSA1000 



Compaq ProLiant Cluster HA/F200 for MSA1000 

The Compaq ProLiant Cluster HA/F200 for MSA1000 (HA/F200 for MSA1000) 
adds Compaq SANworks™ Secure Path for Windows Workgroup Edition software 
and a redundant Fibre Channel path to the HA/F100 for MSA 1000 configuration. The 
redundancy manager software, in conjunction with redundant Fibre Channel paths, 
enhances the high availability features of the HA/F200 for MSA 1000. 

The HA/F200 for MSA 1000 includes these hardware solution components: 

• Two Compaq ProLiant servers 

• One or more Compaq Storage Works Modular SAN Array 1000 storage systems 
(MSA 1000) 

• Two Compaq Storage Works Modular SAN Array 1000 Controllers per 
MSA 1000 storage system (MSA 1000 Controller) 
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• Two of the following hubs or switches: 

— Compaq Storage Works Fibre Channel Storage Hub 7 (7 -port FC-AL hub) 

— Compaq StorageWorks Fibre Channel SAN Switch 8 (SAN Switch/8) 

— Compaq StorageWorks Fibre Channel SAN Switch 16 (SAN Switch/16) 

— Compaq StorageWorks Fibre Channel SAN Switch 8-EL (SAN Switch/8EL) 

— Compaq StorageWorks Fibre Channel SAN Switch 16-EL 
(SAN Switch/1 6EL) 

— Compaq StorageWorks Modular SAN Array 1000 Fabric Switch 6 
(MSA Fabric Switch 6) 

• Two Compaq StorageWorks FCA-2101 host bus adapters per server 
(FCA-2101 HBA) 

• Network Interface cards (NICs) 

• Gigabit Interface Converter-Shortwave (GBIC-SW) modules 

• Cables: 

— Multimode Fibre Channel cable 

— Ethernet crossover cable (included in the cluster kit) 

— CAT 3, 4, or 5 UTP (2 or 4 pair) cable 

The HA/F200 for MSA 1000 includes these software solution components: 

• One of the following Microsoft Windows operating systems: 

— Microsoft Windows NT Server 4.0, Enterprise Edition (Windows Enterprise 
Edition) with Microsoft Cluster Server (MSCS) 

— Microsoft Windows 2000 Advanced Server with Microsoft Cluster Service 
MSCS) 

• Compaq SmartStart and Support Software CD (included in the Server Setup and 
Management Pack) 

• Compaq Modular SAN Array 1000 Support Software CD 

• Compaq SAN works Secure Path Version 3. IB for Windows Workgroup Edition 
(included in the cluster kit) 
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• Compaq Management tools 

— Compaq Insight Manager (optional) 

— Compaq Insight Manager XE (optional) 

— Compaq Insight Manager 7 (optional) 

• Compaq Intelligent Cluster Administrator (optional) (included in the HA/F200 
cluster kit) 

NOTE: See Appendix A, "Software and Firmware Versions," for the necessary software 
version levels for your cluster. 

The following illustration depicts the basic HA/F200 for MSA 1000 configuration. 




LAN 



Figure 1-2: Hardware components of the Compaq 
ProLiant Cluster HA/F200 for MSA1000 
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Compaq ProLiant Servers 

Compaq industry-standard servers are a primary component of all models of Compaq 
ProLiant Clusters. At the high end of the ProLiant server line, several high 
availability and manageability features are incorporated as a standard part of the 
server feature set. These include: 

• Online backup processors 

• PCI bus with hot-plug capabilities 

• Redundant hot-pluggable fans 

• Redundant processor power modules 

• Redundant Network Interface Controller (NIC) support 

• Dual-ported hot-pluggable 10/100 NICs 

• Redundant hot-pluggable power supplies (on most high-end models) 

Many of these features are available at the low end and mid range of the Compaq 
ProLiant server line as well. 

Compaq has logged thousands of hours testing multiple models of Compaq servers in 
clustered configurations and has successfully passed the Microsoft Hardware Cluster 
Certification Test Suite on numerous occasions. In fact, Compaq was the first vendor 
to be certified using a shared storage subsystem connected to ProLiant servers 
through Fibre Channel Arbitrated Loop technology. 

The Microsoft Cluster Software is based on a cluster architecture known as shared 
storage clustering, in which clustered servers share access to a common set of hard 
drives. Microsoft Cluster Software requires all clustered (shared) data to be stored in 
an external storage system. 

The Compaq Storage Works Modular SAN Array 1000 storage system is the shared 
storage system for the Compaq ProLiant Clusters HA/F100 and HA/F200 for 
MSA 1000. 
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Compaq StorageWorks Modular SAN Array 1000 

The MSA1000 is the storage cabinet that contains the disk drives, power supplies, 
and array controllers. The MSA1000 supports: 

• The same hot-pluggable drives as Compaq servers and Compaq ProLiant storage 
systems 

• Online capacity expansion 

• Online spares 

• RAID fault tolerance of SMART-2 Array Controller technology 

• Hot-pluggable, redundant power supplies and fans 

• Hot-pluggable hard drives 

The HA/F100 for MSA1000 and HA/F200 for MSA1000 cluster systems must have 
at least one MSA 1000 set up as an external shared storage. Consult the Order and 
Configuration Guide for Compaq ProLiant Cluster HA/F100 and HA/F200 at the 
Compaq ProLiant Clusters High Availability website 

(www.compaq.com/highavailability) to determine the maximum supported cluster 
configuration. 

For more detailed information on the MSA 1000, refer to the Compaq StorageWorks 
Modular SAN Array 1000 User Guide. 

Compaq StorageWorks Modular SAN Array 1000 Controller 

MSA 1000 Controller is fully RAID capable and manages all of the drives in the 
MSA 1000 storage array. Each MSA 1000 is shipped with one controller installed. In 
an HA/F100 for MSA 1000 cluster, each array controller is connected to both servers 
through a single Fibre Channel storage hub or Fabric switch. In an HA/F200 for 
MSA 1000 cluster, the addition of a second Compaq StorageWorks Modular SAN 
Array 1000 Redundant Controller is required to provide redundancy. 
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These redundant controllers are connected to each server through two separate, and 
redundant, Fibre Channel storage hubs or Fabric switches. This dual-connection 
configuration implements a vital aspect of the enhanced high availability features of 
the HA/F200 for MSA 1000 cluster. Each of these components is discussed in the 
following sections. For more information, refer to the MSA1000 Controller 
documentation. 

For more information about shared storage clustering, refer to the Microsoft 
clustering documentation. 

Connection Infrastructure for the MSA1000 

The servers in a Compaq ProLiant Cluster HA/F100 and HA/F200 for MSA1000 are 
connected to one or more MSA 1000 shared external storage systems using industry- 
standard Fibre Channel (FC) technology. The components used to implement the FC 
include shortwave (multimode) fiber-optic cables, Gigabit Interface Converters- 
Shortwave (GBIC-SW), and Fibre Channel storage hubs or Fabric switches. 

Compaq StorageWorks Fibre Channel Storage Hub 7 

The Compaq StorageWorks Fibre Channel Storage Hub 7 allows up to five 
MSA 1000 storage systems to be connected to the cluster servers in a "star" topology. 
For the HA/F100 cluster, a single hub is used. For the HA/F200 cluster, two 
redundant Fibre Channel storage hubs are used. 

For more information, refer to the Compaq StorageWorks Fibre Channel Storage 
Hub 7 Installation Guide. 

Compaq StorageWorks SAN Switches 

A Compaq StorageWorks SAN Switch is the core component of an affordable 
storage area network (SAN) solution that will consolidate storage, simplify storage 
management, manage explosive data growth, and reduce business downtime. The 
SAN Switch is a high-performance, switch engine, interconnect component that will 
help you take that important step toward building a low-cost SAN. Built on the 
stable, easy-to-use, and mature FC protocol, the SAN Switch offers numerous ports 
with dedicated nonblocking 100-MB/s point-to-point parallel connections. 
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Using the SAN Switch as the cornerstone of your SAN deployment, you can start by 
combining your primary storage components, such as the MSA1000 storage systems, 
and secondary storage Enterprise Backup Solution components, such as tape libraries 
(TL890/TL891/TL895), on the same departmental SAN. With a 12-Gbps switch 
engine, the SAN Switch delivers the necessary resiliency and speed to isolate your 
client-server network from heavier storage network traffic. 

The SAN Switch can be easily managed using tools such as StorageWorks Command 
Console (SWCC), Compaq Insight Manager 7, Compaq Insight Manager XE, Array 
Configuration Utility (ACU), and the StorageWorks Switch Management Utility. 

For more information, refer to the appropriate SAN Switch guide. 

Compaq StorageWorks FCA-2101 Host Bus Adapter 

Compaq StorageWorks FCA-2101 host bus adapter is the interface between the 
servers and the MSA 1000 storage system. At least two host bus adapters, one for 
each cluster node, are required in the Compaq ProLiant Cluster HA/F100 for 
MSA1000. At least four host bus adapters, two for each cluster node, are required in 
the HA/F200 for MSA 1 000 cluster configuration. 

For more information, refer to the HBA documentation. 

Gigabit Interface Converter-Shortwave 

Two Gigabit Interface Converter-Shortwave (GBIC-SW) modules are required for 
each Fibre Channel cable installed. Two GBIC-SW modules are provided with each 
MSA 1 000, MS A 1 000 Controller, and host bus adapter. 

GBIC-SW modules hot-plug into Fibre Channel storage hubs, array controllers, and 
host bus adapters. These converters provide ease of expansion and 100-MB/s 
performance. GBIC-SW modules support distances up to 500 meters using 
multimode fibre-optic cable. 

For more information, refer to the GBIC-SW documentation. 
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Cluster Interconnect 

The cluster interconnect is a data path over which nodes of a cluster communicate. 
This type of communication is termed intracluster communication. At a minimum, 
the interconnect consists of two network adapters (one in each server) and a cable 
connecting the adapters. 

The cluster nodes use the interconnect data path to: 

• Communicate individual resource and overall cluster status. 

• Send and receive heartbeat signals. 

• Update modified registry information. 

IMPORTANT: MSCS requires TCP/IP as the cluster communication protocol. When 
configuring the interconnects, be sure to enable TCP/IP. 

Client Network 

Every client/server application requires a local area network, or LAN, over which 
client machines and servers communicate. The components of the LAN are no 
different than those of a stand-alone server configuration. 

Because clients who want the full advantage of the cluster will now connect to the 
cluster rather than to a specific server, you will need to configure client connections 
differently from stand-alone server connections. Clients will connect to virtual 
servers, which are cluster groups that contain their own IP addresses. 

Within this guide, communication between the network clients and the cluster is 
termed cluster-to-LAN communication. 
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Private or Public Interconnect 

There are two types of interconnect paths: 

• A private interconnect (also known as a dedicated interconnect) is used solely for 
intracluster (node-to-node) communication. Communication to and from network 
clients does not occur over this type of interconnect. 

• A public interconnect not only takes care of communication between the cluster 
nodes, it also shares the data path with communication between the cluster and its 
network clients. 

Interconnect Adapters 

Ethernet adapters can be used for the interconnect between the servers in a Compaq 
ProLiant Cluster. Either 10-Mb/s or 100-Mb/s Ethernet may be used. 

Ethernet adapters can be connected together using an Ethernet crossover cable or a 
private Ethernet hub. Both of these options provide a dedicated interconnect. 

Implementing a direct Ethernet connection minimizes potential single points of 
failure. 

Redundant Interconnects 

To reduce potential disruptions of intracluster communication, you need to use a 
redundant path over which communication can continue if the primary path is 
disrupted. 

Compaq recommends configuring the client LAN as a backup path for intracluster 
communication. This provides a secondary path for the cluster heartbeat in case the 
dedicated primary path for intracluster communications fails. The redundant path is 
configured when installing the cluster software, or it can be added later using the 
MSCS Cluster Administrator. 
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You also need to provide a redundant path to the client LAN by using a second NIC 
as a hot standby for the primary client LAN NIC. 

There are two methods for providing a redundant path, and the method you choose 
depends on your hardware. You can use the Redundant NIC Utility available on all 
Compaq 10/100 Fast Ethernet products, or you can use the Network Fault Tolerance 
feature designed to operate with the Compaq 10/100 Intel silicon-based NICs. These 
features allow two NICs to be configured so that one is a hot backup for the other. 

Cables 

Three general categories of cables are used for Compaq ProLiant HA/F100 and 
HA/F200 for MSA1000 clusters: 

• Server to storage 

• Cluster interconnect 

• Network interconnect 

Server to Storage 

Shortwave (multimode) fiber-optic cables are used to connect the servers, Fibre 
Channel storage hubs and Fabric switches, and MSA 1000 storage systems in an FC 
configuration. 

Cluster Interconnect 

Ethernet NICs are used to implement the interconnect. Two types of cluster 
interconnect cables may be used depending on the type of devices used to implement 
the interconnect, and whether the interconnect is dedicated or shared: 

• Dedicated interconnect using an Ethernet crossover cable — An Ethernet 
crossover cable (supplied in both the HA/F100 and HA/F200 for MSA1000 kits) 
can be used to connect the NICs directly together to create a dedicated 
interconnect. 
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• Dedicated interconnect using standard Ethernet cables and a private Ethernet 
hub — Standard Ethernet cables can be used to connect the NICs together through 
a private Ethernet hub. Note that an Ethernet crossover cable should not be used 
when using an Ethernet hub because the hub performs the crossover function. 

• Shared interconnect using standard Ethernet cables and a public hub — Standard 
Ethernet cables may also be used to connect the NICs to a public network to 
create a nondedicated interconnect. 

Network Interconnect 

Standard Ethernet cables are used to provide this type of connection. 

Microsoft Software 

Microsoft Windows NT Server 4.0/Enterprise Edition (Windows Enterprise Edition) 
and Microsoft Windows 2000 Advanced Server are the operating systems for the 
Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000. The Microsoft 
clustering software, Cluster Server for Windows Enterprise Edition and Microsoft 
Cluster Service for Windows 2000 Advanced Server (MSCS), provides the 
underlying technology to: 

• Send and receive heartbeat signals between the cluster nodes. 

• Monitor the state of each cluster node. 

• Initiate failover and fallback events. 

Microsoft Cluster Administrator, another component of Windows Enterprise Edition 
and Windows 2000 Advanced Server, allows you to: 

• Define and modify cluster groups. 

• Manually control the cluster. 

• View the current state of the cluster. 

NOTE: Microsoft Windows Enterprise Edition or Microsoft Windows 2000 Advanced Server 
must be purchased separately, through your Microsoft reseller. 
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Compaq Software 

Compaq offers an extensive set of features and optional tools to support the 
configuration and management of your Compaq ProLiant Cluster: 

• Compaq SmartStart and Support Software CD 

• Compaq Modular SAN Array 1000 Support Software CD 

• Compaq SAN works Secure Path Version 3. IB for Windows Workgroup Edition 

• Compaq Insight Manager 7 

• Compaq Intelligent Cluster Administrator 

Compaq SmartStart and Support Software CD 

Compaq SmartStart is located on the SmartStart and Support Software CD included 
in the Compaq Server Setup and Management pack shipped with ProLiant servers. 
SmartStart is the recommended way to configure the HA/F100 or HA/F200 for 
MSA 1000 cluster. SmartStart uses a step-by-step process to configure the cluster and 
load the system software. For information concerning SmartStart, refer to the 
Compaq Server Setup and Management pack. 

For information about using SmartStart to install the HA/F100 or HA/F200 for 
MSA 1000 cluster, see chapters 3, 4, and 5 of this guide. 

Compaq Array Configuration Utility 

The Compaq Array Configuration Utility, found on the Compaq SmartStart and 
Support Software CD, is used to configure the array controller, add disk drives to an 
existing configuration, and expand capacity. 
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Compaq System Configuration Utility 

The SmartStart and Support Software CD also contains the Compaq System 
Configuration Utility. This utility is the primary means to configure hardware devices 
in your server, such as I/O addresses, boot order of disk controllers, and so on. 

For information concerning the Compaq System Configuration Utility, refer to the 
Compaq Server Setup and Management pack. 

Compaq Support Paq for Microsoft Windows NT 4.0 and 
Microsoft Windows 2000 

The Compaq Support Paq for Microsoft Windows NT 4.0 and Microsoft Windows 
2000 contains device drivers and utilities that enable you to take advantage of 
specific capabilities offered on Compaq products. These drivers are provided for use 
with Compaq hardware only. The Compaq Support Paq also includes an installer that 
analyzes system requirements and installs all drivers. 

The Compaq Support Paq can be installed from the Compaq SmartStart and Support 
Software CD or downloaded from the Compaq website: 

www.compaq.com/support 

Compaq Modular SAN Array 1000 Support Software CD 

The Compaq Modular SAN Array 1000 Support Software CD contains drivers and 
utilities required for the MSA 1000 storage system. The CD contains such items as 
the Compaq Fibre Channel Host Bus Adapter drivers, which are required for the 
FCA-2101 host bus adapters to interface with the MSA 1000 storage system. The CD 
also contains the ACU, which is a disk configuration utility that allows the user to 
view, set up, and configure Compaq array controllers and storage systems. 

For more information on the Compaq Modular SAN Array 1000 Support Software 
CD, refer to the Compaq Modular SAN Array 1000 Setup and Management Kit. 
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Compaq SANworks Secure Path Version 3.1 B for Windows 
Workgroup Edition 

Compaq SANworks Secure Path is a software component that works in conjunction 
with the Windows 2000 Advanced Server operating system and the Windows NT file 
system (NTFS). Secure Path increases the availability of both single-server and 
clustered systems that use the Compaq StorageWorks Modular SAN Array storage 
system and Compaq ProLiant servers. Secure Path can detect failures in the host bus 
adapter, array controller, or other FC components. 

When such a failure occurs, I/O processing is rerouted through a redundant path, 
allowing applications to continue processing. This rerouting is transparent to 
Windows Enterprise Edition and Windows 2000 Advanced Server. Therefore, in an 
HA/F200 for MSA 1000 cluster configuration, it is not necessary for MSCS to fail 
resources over to the other node. Secure Path, in combination with redundant 
hardware components, is the basis for the enhanced high availability features of the 
HA/F200 for MSA 1000 cluster running Windows Enterprise Edition and Windows 
2000 Advanced Server. 

Two licenses of Secure Path are included in your Compaq ProLiant Cluster HA/F200 
for MSA 1000 Cluster Kit. Secure Path is licensed on a per-server basis and can be 
purchased separately or in the cluster kit. 

For more information about installing Secure Path in an HA/F200 for MSA 1000 
cluster, see Chapter 3 of this guide. For detailed information about the Secure Path 
software, refer to the Compaq SANworks Secure Path Version 3. IB for Windows 
Workgroup Edition Installation and Reference Guide. 
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Compaq Insight Manager 7 

Compaq Insight Manager 7 is a Web-based management system and is located on the 
Compaq Management CD shipped with each ProLiant server. It can be used in 
conjunction with Compaq Insight Manager agents as well as its own Web-enabled 
agents. This browser-based utility provides increased flexibility and efficiency for the 
administrator. It extends the functionality of Compaq Insight Manager and works in 
conjunction with the Cluster Monitor subsystem, providing a common data repository 
and control point for enterprise servers and clusters, desktops, and other devices, 
using either SNMP- or DMI-based messaging. 

Compaq Insight Manager 7 can perform functions such as: 

• Monitoring fault conditions and system status 

• Monitoring shared storage and interconnect adapters 

• Forwarding server alert fault conditions 

• Remotely controlling servers 

The HA/F100 and HA/F200 for MSA1000 clusters also support Compaq 
Insight Manager and Compaq Insight Manager XE. 

For more information concerning Compaq Insight Manager 7, refer to the Compaq 
Server Setup and Management pack shipped with each ProLiant server or access the 
Compaq website: 

www.compaq.com/manage 

Compaq Intelligent Cluster Administrator 

Compaq Intelligent Cluster Administrator (CICA) extends Compaq Insight Manager 
and Cluster Monitor by enabling Intelligent Cluster Administrator to configure and 
manage ProLiant clusters from a Web browser. With CICA, you can copy, modify, 
and dynamically install a cluster configuration on the same physical cluster or on any 
physical cluster anywhere in the system, through the Web. 
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CICA checks for any cluster destabilizing conditions, such as disk thresholds or 
application slowdowns, and reallocates cluster resources to meet processing 
demands. This software also performs dynamic allocation of cluster resources that 
may be failing without causing the cluster to fail over. 

CICA also provides initialized cluster configurations that allow rapid cluster 
generation as well as cluster configuration builder wizards for extending the Compaq 
initialized configurations. 

CICA is included with the HA/F200 for MSA 1000 cluster kit and can be purchased 
as a stand-alone component for the HA/F100 for MSA 1000 cluster. Intelligent 
Cluster Administrator is licensed on a per-cluster basis. 

For more information, refer to the Compaq Intelligent Cluster Administrator User 
Guide Version 2.1. 

Resources for Application Installation 

The client/server software applications are among the key components of any cluster. 
Compaq is working with its key software partners to ensure that cluster-aware 
applications are available and that the applications work seamlessly on Compaq 
ProLiant clusters. 

Compaq provides a number of white papers to assist you with installing these 
applications in a Compaq ProLiant Cluster environment. 

To download current versions of these white papers and other technical documents, 
visit the Compaq High Availability website: 

www.compaq.com/highavailability 

IMPORTANT: Your software applications may need to be updated to take full advantage of 
clustering. Contact your software vendors to check whether their software supports MSCS and 
to ask whether any patches or updates are available for MSCS operation. 
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2 

Reducing Single Points of Failure and 
Enhancing High Availability Features 



This chapter discusses how to reduce single points of failure in the HA/F100 
for MSA 1000 cluster and enhanced high availability features of the HA/F200 for 
MS A 1000 cluster. 

In addition to reading this chapter, read the planning chapter in the Microsoft 
documentation that came with your operating system. 
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Reducing Single Points of Failure in the HA/F100 for 
MSA1000 Configuration 

Depending on your needs, you may leave all vulnerable areas alone, accepting the 
risk associated with a potential failure. Or, if the risk of failure is unacceptable for a 
given area, you may elect to use a redundant component to minimize or remove the 
single point of failure. 

NOTE: Although not specifically covered in this section, redundant server components (such 
as power supplies and processor modules) should be used wherever possible. These features 
will vary based on your specific server model. 

The single points of failure described in this section are: 

• Cluster interconnect 

• Fibre Channel data paths 

• Nonshared disk drives 

• Shared disk drives 

NOTE: The Compaq ProLiant Cluster HA/F200 for MSA1000 addresses the single points of 
failure previously listed with its dual-redundant loop configuration. For more information, refer 
to the "Enhanced High Availability Features of the HA/F200 for MSA1000" section of this 
chapter. 

Cluster Interconnect 

The interconnect is the primary means for the cluster nodes to communicate. 
Intracluster communication is crucial to the health of the cluster. If communication 
between the cluster nodes ceases, the Microsoft clustering software must determine 
the state of the cluster and take action, in most cases bringing the cluster groups 
offline on one of the nodes and failing over all cluster groups to the other node. 

Following are two strategies for increasing the availability of intracluster 
communication. Combined, these strategies provide even more redundancy. 
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Microsoft Clustering Software Configuration 

Microsoft Cluster Server for Windows Enterprise Edition and Cluster Service for 
Windows 2000 Advanced Server (MSCS) allow you to configure a primary and 
backup path for intracluster communication that will reduce the possibility of an 
intracluster communication disruption. Any network interface card (NIC) in the 
nodes can be configured to serve as a backup path for node-to-node communication. 
When the primary path is disrupted, the transfer of communication responsibilities is 
undetected by applications running on the cluster. Whether a dedicated or public 
interconnect has been set up, a separate NIC should be configured to act as a 
redundant interconnect. This is an easy and inexpensive way to add redundancy to 
intracluster communication. 

Redundant Interconnect Card 

Another strategy to increase availability is to use a redundant interconnect card. This 
may be done for either the dedicated intracluster communication path, or for the 
client LAN. If you are using a dedicated, direct-connection interconnect 
configuration, you can install a second, dedicated, direct-connection interconnect. 

There are two implementations that provide identical redundant NIC capability. The 
implementation you choose will depend on your hardware: 

• Compaq TLAN Teaming and Configuration Utility — This utility is supported on 
all Compaq Tl-based Ethernet and Fast Ethernet NICs, such as NetFlex-3 and 
Netelligent 10/100 TX PCI Ethernet NICs. 

• Compaq Network Teaming and Configuration Utility — This utility is designed to 
operate with the Compaq Intel-based 10/100 NICs. 

Combining these utilities with the appropriate NICs will enable a seamless, 
undetectable failover of the primary interconnect to the redundant interconnect. 

NOTE: These two methods of NIC redundancy cannot be combined in a single redundant NIC 
pair: Tl-based NICs may not be paired with Intel-based NICs to create a redundant pair. 
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Because the purpose of the redundant interconnect is to increase the availability of 
the cluster, it is important to monitor the status of your redundant NICs. Compaq 
Insight Manager, Compaq Insight Manager XE, and Compaq Insight Manager 7 
simplify management of the interconnect by monitoring the state of the NIC. You can 
view status information and alert conditions for all cards in each node. If a failover 
event occurs because of a disruption in the heartbeat, you can use the Compaq 
Insight Manager tools to determine where the disruption originated. 

Cluster-to-LAN Communication 

Each cluster node must have at least one NIC that connects to the LAN. Through this 
connection, network clients can access applications and data on the cluster. If the 
LAN NIC fails in one of the nodes, any clients connected directly to the cluster node 
by means of the computer name, cluster node IP address, or MAC address of the NIC 
will no longer have access to their applications. Clients connected to a virtual server 
on the cluster (through the IP address or network name of a cluster group) reconnect 
to the cluster through the surviving cluster node. 

Failure of a LAN NIC in a cluster node may have serious repercussions. If your 
cluster is configured with a dedicated interconnect and a single LAN NIC, the failure 
of a LAN NIC will prevent network clients from accessing cluster groups running on 
that node. If the interconnect path is not disrupted, it is possible that a failover will 
not occur. The applications will continue to run on the node with the failed NIC; 
however, clients will be unable to access them. 

Install redundant NICs and use the proper redundant NIC utility to reduce the 
possibility of LAN NIC failure. When your cluster nodes are configured with the 
utility, the redundant NIC automatically takes over operation if the primary NIC fails. 
Clients maintain their connection with their primary node and, without disruption, 
continue to have access to their applications. 

Compaq offers a dual-port NIC that can use the Compaq Redundant NIC Utility, thus 
reducing the possibility of the failure scenario previously described. However, if the 
entire NIC or the NIC slot on the node fails, the same failure scenario will occur. 

Compaq Insight Manager, Compaq Insight Manager XE, and Compaq 
Insight Manager 7 monitor the health of any network cards used for the LAN. If any 
of the cards experience a fault, the Compaq Insight Manager tools mark the card as 
"Offline" and change its condition to the appropriate status. 
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Recommended Cluster Communication Strategy 

The previous two sections discussed the redundancy of intracluster and 
cluster-to-LAN communication. However, to obtain the most benefit while 
minimizing cost and complexity, Compaq recommends viewing cluster 
communications as a single entity. 

To create redundancy for both intracluster and cluster-to-LAN communication: 

1 . Employ physical hardware redundancy for the LAN NICs. 

2. Configure the Microsoft clustering software to use both the primary and 
redundant LAN NIC as backup for intracluster communication. 

With this strategy, your cluster can continue normal operations (without a failover 
event) if these points of failure are encountered: 

• Failure of the interconnect card 

• Failure of the interconnect cable 

• Failure of the port on the LAN NIC 

• Failure of the LAN NIC (if redundant NICs, as opposed to dual-ported NICs, 
are used) 

• Failure of the Ethernet cable running from a cluster node to the Ethernet hub 
(which connects to the LAN) 

The following examples describe how to physically set up your cluster nodes to 
employ the strategy recommended by Compaq. 

Example 1 

A Compaq dual-port NIC and a single-port NIC are used in this example 
(Figure 2-1). The first port of the dual-port NIC is a dedicated interconnect, and the 
second port is the backup path for the cluster-to-LAN network. The single-port NIC 
is configured as the primary network path for cluster-to-LAN communication. 
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The TLAN Teaming and Configuration Utility (for ThunderLAN NICs) and the 
Network Teaming and Configuration Utility (for Intel NICs) are used to configure 
the second port on the dual-port NIC as the backup port of a redundant pair. The 
single port on the other NIC is configured to be the primary port for cluster-to-LAN 
communication. 



The interconnect retains its fully redundant status when MSCS is configured to use 
the other network ports as an interconnect backup. Failure of the primary 
interconnect path results in intracluster communications occurring over the single- 
port NIC because the single-port NIC was configured in MSCS as the backup for 
intracluster communication. If the entire dual-port NIC fails, the cluster nodes still 
have a working communication path over the single-port NIC. 

With this configuration, even a failure of the dual-port NIC results in the transfer of 
the cluster-to-LAN communication to the single-port NIC. Other than a failure of the 
network hub, the failure of any cluster network component will be resolved by the 
redundancy of this configuration. 
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Example 2 



The second example configuration consists of three single-port NICs (Figure 2-2). 
One NIC is dedicated to intracluster communication. The other two NICs are used for 
cluster-to-LAN communication. The Compaq Advanced Network Control Utility is 
used to configure two of the NICs — one as the primary and one as the standby of a 
redundant pair. 

The interconnect is fully redundant when the Microsoft clustering software is 
configured to use the other network cards as backups for the interconnect. Failure of 
the primary interconnect path results in intracluster communications occurring over 
the primary NIC of the redundant pair. If the entire interconnect card fails, the cluster 
nodes will still have a working communication path. 

The cluster-to-LAN communication is fully redundant up to the network hub. With 
this configuration, even a failure of the primary NIC results only in the transfer of the 
network path to the standby NIC. Other than a failure of the network hub, any failure 
of any cluster network component will be resolved by the redundancy of this 
configuration. 

The primary disadvantage of this configuration as compared to Example 1 is that an 
additional card slot is used by the third NIC. 
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HA/F100 for MSA1000 Fibre Channel Data Paths 

The Compaq Storage Works Modular SAN Array 1000 storage system is the 
mechanism with which ProLiant Clusters implement shared storage. Generally, the 
storage system consists of a host bus adapter in each server, a storage hub or switch, a 
Compaq Storage Works Modular SAN Array 1000 Controller, and a Compaq 
Storage Works Modular SAN Array 1000 (MSA 1000) into which the SCSI disks are 
placed. 

The MSA 1000 storage system has two distinct data paths, separated by the Fibre 
Channel storage hub or Fabric switch: 

• The first data path runs from the host bus adapters in the servers to the Fibre 
Channel storage hub or Fabric switch. 

• The second data path runs from the Fibre Channel storage hub or Fabric switch to 
the MSA 1000 storage system. 

The effects of a failure vary depending on whether the failure occurs on the first or 
second data path. 
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Failure of the Host Bus Adapter-to-Storage Hub Data Path 

If the host bus adapter-to-storage hub path fails (Figure 2-3), it results in a failover of 
all applications. For instance, if one server can no longer access the storage hub (and 
by extension the shared storage), all of the cluster groups that depend on shared 
storage will fail over to the second server. The cost of failure is relatively minor. It is 
the downtime experienced by users while the failover event occurs. 
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Figure 2-3: Host bus adapter-to-storage hub data path 



Note that the Compaq Insight Manager tools monitor the health of the MSA 1000 
storage system. If any part of the Fibre Channel data path disrupts a server's access to 
the MSA1000, the array controller status changes to "Failed" and the condition is red. 
The red condition moves up to higher-level Compaq Insight Manager screens and 
eventually to the device list. 

NOTE: The Compaq Insight Manager tools display a failure of physical hardware through the 
Mass Storage button on the View screen, marking the hardware "Failed." A logical drive in the 
cluster is reported on the Cluster Shared Resources screen as a logical disk resource. 
Compaq Insight Manager, Compaq Insight Manager XE, and Compaq Insight Manager 7 do 
not associate the logical drive with the physical hardware. 
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Failure of the Hub-to-MSA1000 Data Path 



The second data path (Figure 2-4), from the storage hub to the MSA 1000 storage 
system, has more severe implications when it fails. If this data path fails, all clustered 
applications become inoperable. Even attempting to failover the applications to 
another cluster node will not gain access to the MSA 1000 storage system. 

NOTE: This failure scenario can be avoided by deploying the redundant Fibre Channel loop 
configuration of the Compaq ProLiant Cluster HA/F200 for MSA1000. 
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Figure 2-4: Hub-to-MSA1000 data path 



Without access to shared storage, clustered applications cannot reach their data or log 
files. The data, however, is unharmed and remains safely stored on the physical disks 
inside the MSA 1000 storage system. If a database application was running when this 
failure occurred, some in-progress transactions will be lost. The database will need to 
be rolled back and the in-progress transactions re-entered. 

Like the server-to-storage hub data path, the Compaq Insight Manager tools detect 
this fault, change the MSA1000 status to "Failed," and change its condition to red. 
The red condition moves up through Compaq Insight Manager screens, eventually to 
the device list. 



2-10 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 Administrator Guide 



Reducing Single Points of Failure and Enhancing High Availability Features 



Nonshared Disk Drives 

Nonshared disk drives, or local storage, operate the same way in a cluster as they do 
in a single-server environment. These drives can be in the server drive bays or in an 
external storage cabinet. As long as they are not accessible by both servers, they are 
considered nonshared. 

Treat nonshared drives in a clustered environment as you would in a nonclustered 
environment. Most likely, some form of RAID is used to protect the drives and 
restore a failed drive. Because the operating system is stored on these drives, use 
either hardware or software RAID to protect the information. Hardware RAID is 
available with the Compaq Smart Array series controller or by using a nonshared 
storage system. 

Shared Disk Drives 

Shared disk drives are contained in the MSA 1000 storage system, which can be 
accessed by each cluster node. Use the Compaq Array Configuration Utility (ACU) 
to configure hardware RAID 1+0, 5, or Advanced Data Guarding (ADG) on all of 
your shared disk drives. 

If you do not use RAID 1+0, 5, or ADG, failure of a shared disk drive will disrupt 
service to all clustered applications and services that depend on the drive. Failover of 
a cluster node will not resolve this failure because neither server can read from a 
failed drive. 

NOTE: Windows Enterprise Edition or Windows 2000 Advanced Server software RAID is not 
available for shared drives when using MSCS. Hardware RAID is the only available RAID 
option for shared storage. 

As with other system failures, Compaq Insight Manager monitors the health of disk 
drives and marks a failed drive as "Failed." 
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Enhanced High Availability Features of the HA/F200 
forMSAIOOO 

A single point of failure refers to any component in the system that, should it fail, 
prevents the system from functioning. Single points of failure in hardware can be 
minimized, and in some cases eliminated, by using redundant components. The most 
effective way of accomplishing this is by clustering. 

The HA/F100 for MSA 1000 cluster reduces the single points of failure that exist in a 
single-server environment by allowing two servers to share storage and take over for 
each other in the event that one server fails. The HA/F200 for MSA1000 cluster goes 
one step further by implementing a dual-redundant Fibre Channel Arbitrated Loop 
configuration. 

The HA/F200 for MSA 1000 cluster further enhances high availability through the 
use of additional, redundant components in the server-to-storage connection and in 
the shared storage system itself. In the event of a failure, processing is switched to an 
alternate path without affecting applications and end users. In fact, this path switch is 
transparent even to the Windows NT and Windows 2000 file system (NTFS). The 
combination of multiple paths and redundant hardware components provided by the 
HA/F200 for MSA 1000 cluster offers significantly enhanced high availability over 
nonredundant configurations. 

A single component failure in the HA/F200 for MSA 1000 cluster results in an 
automatic failover to an alternate component, allowing end users to continue 
accessing their applications without interruption. Some typical failures and associated 
responses in an HA/F200 for MSA1000 configuration are: 

• A server failure causes the Microsoft clustering software to fail application 
processing over to the second server. 

• A host bus adapter failure causes I/O requests intended for the failed adapter to 
be rerouted through the redundant adapter. 

• A storage hub, switch, or cable failure is treated like a host bus adapter failure 
and a failover to the second host bus adapter, which is using a different storage 
hub and cables, occurs. 

• An array controller failure causes the redundant array controller to take over for 
the failed controller. 
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In all of the previous examples, end users will experience minimal interruptions 
while the failover occurs. In some cases, the interruptions might not even be 
noticeable. 

The following illustration depicts the HA/F200 for MSA 1000 configuration 
components. 



The MSA 1000 storage system is the mechanism with which the HA/F200 cluster 
implements shared storage. The HA/F200 for MSA 1000 cluster minimum 
configuration consists of two host bus adapters in each server, two Fibre Channel 
storage hubs or Fabric switches, two array controllers per MSA1000 storage system, 
and one or more MSA1000 storage systems. 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 Administrator Guide 2-13 




LAN 



Figure 2-5: HA/F200 configuration 



HA/F200 Fibre Channel Data Paths 



Reducing Single Points of Failure and Enhancing High Availability Features 



The MSA 1000 storage system has active and standby data paths, separated by two 
Fibre Channel storage hubs or Fabric switches. Figure 2-6 and Figure 2-7 detail the 
active and standby paths of the minimum HA/F200 for MSA 1000 configuration. 
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Figure 2-6: 
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Active host bus adapter-to-storage data 



2-14 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 Administrator Guide 



Reducing Single Points of Failure and Enhancing High Availability Features 



The active data paths run from the active host bus adapters in the servers to the active 
storage hub. If this path fails, the applications can seamlessly fail over to the standby 
host bus adapter-to-storage hub data paths (Figure 2-7). 



■□A 

OS 



AD 
SD 



Server 



Server 



storage hub storage hub 

or switch or switch 



Active 



Standby 



MSA1000 



Figure 2-7: Active hub-to-storage data path 
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Reducing Single Points of Failure and Enhancing High Availability Features 



The second active data path runs from the active hub or switch to the MSA 1000 
storage system. If this path fails, the applications can seamlessly fail over to the 
standby hub-to-MS A 1 000 data path. 

The dual-redundant loop feature of the HA/F200 for MSA 1000 cluster increases the 
level of availability over clusters that have only one path to the shared storage. In 
addition, the second path in the HA/F200 for MSA 1000 cluster provides for 
improved performance through static load balancing. 
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Setting Up the Compaq ProLiant Clusters 
HA/F100 and HA/F200 for MSA1000 



Preinstallation Overview 

This chapter provides instructions for building a new Compaq ProLiant 
Cluster HA/F100 or a Compaq ProLiant Cluster HA/F200 for MSA1000. 

If you are planning to upgrade from an HA/F100 for MSA1000 to an HA/F200 for 
MSA1000 configuration, see Chapter 4 for more details. 

If you are planning to migrate the operating system of an HA/F100 for MSA 1000 or 
HA/F200 for MSA 1000, see Chapter 5 for more details. 

The Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 are 
combinations of several individually available products. Have the following 
documents available as you set up your cluster: 

• Documentation for the clustered Compaq ProLiant servers 

• Compaq shared external storage documentation 

• Compaq host bus adapter documentation 

• Installation guide for the NIC of your choice 

• Installation guide for the Compaq Fibre Channel storage hub or Fabric switch of 
your choice 
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• Documentation received with your operating system 

— Microsoft Windows NT Server 4.0, Enterprise Edition (Windows Enterprise 
Edition) 

— Microsoft Windows 2000 Advanced Server 

• Compaq SmartStart for Servers Setup Poster 

• Compaq Insight Manager Installation Poster 

• Compaq Intelligent Cluster Administrator Quick Setup Guide 

• Microsoft clustering documentation 

The installation and setup of your ProLiant Cluster is described in the following 
sections: 

• Preinstallation guidelines 

• Installing the hardware, including: 

— Cluster nodes 

— MSA 1000 storage system 

— Cluster interconnect 

• Installing the software, including: 

— Compaq SmartStart for Servers 

— Microsoft Windows 2000 Advanced Server or Microsoft Windows NT 
Server 4.0, Enterprise Edition 

— Compaq SAN works Secure Path Version 3. IB for Windows Workgroup 
Edition 

— Compaq Insight Manager (optional) 

— Compaq Insight Manager XE (optional) 

— Compaq Insight Manager 7 (optional) 

— Compaq Intelligent Cluster Administrator (optional) 
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• Additional cluster verification steps, including: 

— Verifying creation of the cluster 

— Verifying node failover 

— Verifying network client failover 

These installation and configuration steps are described in the following sections. 

Preinstallation Guidelines 

Before installing MSCS on a cluster node, write down the answers to the following 
questions: 

• Are you forming or joining a cluster? 

• What is the cluster name? 

• What is the user name, password, and domain for the domain account that MSCS 
will run under? 

• What disks will you use for shared storage? 

• Which shared disk will you use to store permanent cluster files? 

• What are the adapter names and IP addresses of the network adapter cards you 
will use for client access to the cluster? 

• What are the adapter names and IP addresses of the network adapter cards you 
will use for the dedicated interconnect between the cluster nodes? 

• What is the IP address and subnet mask of the address you will use to administer 
the cluster? 

• What are the slot numbers of the controllers to be managed by the cluster? 

Installing clustering software requires several specific steps and guidelines that may 
not be necessary when installing software on a single server. Read and understand the 
following items before proceeding with any software installation: 

• Ensure that you have sufficient software licensing rights to install the Microsoft 
Windows operating system and software applications on each server. 
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• Ensure that the Fibre Channel storage hub or Fabric switch has AC power. 

• Power up the MSA 1000 storage system before the cluster nodes are powered up. 

• Log on to the domain using an account that has administrative permissions on 
both cluster nodes. When installing MSCS, both cluster nodes must be in the 
same Microsoft Windows NT or Windows 2000 domain. The cluster nodes can 
be members of an existing Windows NT or Windows 2000 domain, or they can 
both be member servers. 

Windows NT users can make up their own domain by assigning one as Primary 
Domain Controller (PDC) and one as Backup Domain Controller (BDC), or they 
can both be a BDC in an existing Windows NT domain. 

• Configure the drives in the MSA 1000 storage system using the Compaq Array 
Configuration Utility (ACU), version 2.80 or later. The ACU stores the drive 
configuration information on the drives themselves. After you have configured 
the shared drives from one of the cluster nodes it is not necessary to configure the 
drives from the other cluster node. 

When the ACU runs on the first cluster node, configure the shared drives in the 
MSA1000. When the utility is run on the second cluster node, it will display 
information on the shared drives that was entered when the ACU was run on the 
first node. Accept the information as displayed and continue. 

• For a manual software installation, use Disk Administrator (Windows Enterprise 
Edition) or Disk Management (Windows 2000 Advanced Server) on the first 
cluster node to configure the shared drives, and allow MSCS to synchronize 
information between the two nodes. 

By running Disk Administrator or Disk Management from the first node, you 
prevent potential problems caused by inconsistent drive configurations. When the 
second cluster node joins the cluster, the disk information in the Windows 
Registry is copied from the first node to the second node. 

• Only New Technology File System (NTFS) is supported on shared drives. 

• Only basic disks are supported on shared drives. Dynamic disks are not 
supported. 
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• Assigning permanent drive letters to your shared drives is necessary because 
MSCS software requires drive letters to remain constant throughout the life of the 
cluster. If you are performing manual software installation, use Disk 
Administrator or Disk Management to assign permanent drive letters. 

Microsoft Windows Enterprise Edition or Windows 2000 Advanced Server 
makes dynamic drive letter assignments (when drives are added or removed, or 
when the boot order of drive controllers is changed), but Disk Administrator or 
Disk Management allows you to make permanent drive letter assignments. 

• Ensure that cluster nodes are members of only one cluster. 

• When you set up the cluster interconnect, select TCP/IP as the network protocol. 
MSCS requires the TCP/IP protocol. The cluster interconnect must be on its own 
subnet. The IP addresses of the interconnects must be static, not dynamically 
assigned by DHCP. 

Installing the Hardware 

The following installation steps detail a new installation and setup of an HA/F100 or 
HA/F200 for MSA 1 000 cluster. 

Setting Up the Nodes 

Physically preparing the nodes (servers) for use in a cluster is similar to preparing 
them for individual use. The primary difference is in setting up the shared storage: 

1. Install all necessary adapter cards and insert all internal hard drives. 

2. Attach network cables and plug in SCSI and/or Fibre Channel cables. 

3. Set up one node completely, then set up the second node. 

IMPORTANT: Do not load any software on either cluster node until all the hardware has been 
installed in both cluster nodes. 

NOTE: Compaq recommends that Automatic Server Recovery (ASR) be left at the default 
values for clustered servers. 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 Administrator Guide 



3-5 



Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA 1000 



Follow the installation instructions in your Compaq ProLiant Server documentation 
to set up the hardware. To install Compaq Storage Works FCA-2101 host bus adapter 
and any NICs, follow the instructions in the following sections. 

Installing the Compaq StorageWorks FCA-2101 Host Bus Adapter 

The host bus adapters, which connect the two servers to the storage through Fibre 
Channel storage hubs or Fabric switches, are installed in each server like any other 
PCI card. The HA/F100 for MSA 1000 cluster requires one host bus adapter per 
server, while the HA/F200 for MSA 1000 cluster requires two host bus adapters per 
server. The extra host bus adapter in each server contributes to the enhanced high 
availability features of the HA/F200 for MSA 1000 cluster. The dual host bus 
adapters, in conjunction with dual Fibre Channel storage hubs or Fabric switches and 
dual array controllers, form two completely independent paths to the storage, making 
the server-to-storage connection totally redundant. However, it is important to ensure 
that each host bus adapter in a particular server is connected to a different hub, 
because it is physically possible to connect the servers to the storage hubs in such a 
way that the cluster seems to be working correctly, but will not be able to fail over 
properly. 

Follow the installation instructions in your HBA documentation and your Compaq 
ProLiant server documentation to install the host bus adapter in your servers. 

NOTE: To determine the preferred slots for installing the host bus adapters, use PCI 
bus-loading techniques to balance the PCI bus for your hardware and configuration. For more 
information, refer to your server documentation and the Compaq white paper, "Where Do I 
Plug the Cable? Solving the Logical-Physical Slot Numbering Problem," available from the 
Compaq website at 

www.compaq.com/highavailability 
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The FCA-2101 host bus adapter comes equipped with two LEDs that are used to 
display the state of the host bus adapter. Table 3-1 lists the states in which you might 
find the host bus adapter after installation. 



Table 3-1 : FCA-2101 Host Bus Adapter States 



Green LED 


Yellow LED 


State 


OFF 


OFF 


Wake-UD failure (dead board} 


OFF 


ON 


POST failure (dead board) 


OFF 


Slow blink (1 Hz) 


Wake-up failure (dead board) 


OFF 


Fast blink (4 Hz) 


Failure in POST (dead board) 


OFF 


Flashing (irregular) 


POST processing in progress 


ON 


OFF 


Failure while functioning 


ON 


ON 


Failure while functioning 


ON 


Slow blink (1 Hz) 


Normal for the 1-Gb link rate 


ON 


Fast blink (4 Hz) 


Normal for the 2-Gb link rate 


Slow blink (1 Hz) 


OFF 


Normal when the link is down or not yet 
started 


Slow blink (1 Hz) 


ON 


Not defined 


Slow blink (1 Hz) 


Slow blink (1 Hz) 


Offline for download 


Slow blink (1 Hz) 


Fast blink (4 Hz) 


Restricted offline mode (waiting for 
restart) 


Slow blink (1 Hz) 


Flashing (irregular) 


Restricted offline mode, test active 


Fast blink (4 Hz) 


OFF 


Debug monitor in restricted mode 


Fast blink (4 Hz) 


ON 


Not defined 


Fast blink (4 Hz) 


Slow blink (1 Hz) 


Debug monitor in test fixture mode 


Fast blink (4 Hz) 


Fast blink (4 Hz) 


Debug monitor in remote debug mode 


Fast blink (4 Hz) 


Flashing (irregular) 


Debug monitor output active 
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Installing the Cluster Interconnect 

There are a number of methods to physically set up an interconnect. See Chapter 1 
for a description of the types of interconnect strategies. 

If you are using a dedicated interconnect, install an interconnect adapter card in each 
cluster node. If you are sharing your LAN NIC with your interconnect, install the 
LAN NIC. 

NOTE: To determine the preferred slot for installing the interconnect card, use PCI 
bus-loading techniques to balance the PCI bus for your hardware and configuration. For more 
information, see your server documentation and the Compaq white paper, "Where Do I Plug 
the Cable? Solving the Logical-Physical Slot Numbering Problem," available from the Compaq 
website: 

www.compaq.com 

For specific instructions on how to install an adapter card, refer to the documentation 
for the interconnect card that you are installing or the Compaq ProLiant server you 
are using. The cabling of interconnects is outlined later in this chapter. 

Setting Up the Compaq StorageWorks Modular SAN Array 
1000 Storage System 

Follow the instructions in the Compaq shared external storage documentation to set 
up the MSA 1000 storage system, the Compaq StorageWorks Fibre Channel Storage 
Hub 7, supported Compaq StorageWorks switches, the Compaq StorageWorks 
MSA 1 000 Controller, and the Fibre Channel cables. 
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The Compaq shared external storage documentation explains how to install these 
devices for a single server. Because clustering requires shared storage, you need to 
install these devices for two servers. You will need to run an extra Fibre Channel 
cable from the Fibre Channel storage hub or Fabric switch to the second server 
(Figure 3-1). 



MSA1000 



storage hub 
or switch 



Dedicated Interconnect 



Node 1 



Node 2 



LAN 



Figure 3-1 : MSA1000 storage connected to clustered 
servers in the HA/F100 for MSA1000 configuration 



Powering Up 

Before applying power to the MSA 1000 storage system, ensure that all components 
are installed and connected to the Fibre Channel storage hub or Fabric switch. 

Power up the cluster components in the following order: 

1 . Fibre Channel storage hubs or Fabric switches 

2. Storage systems 

3. Servers 
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Configuring Shared Storage 

The ACU sets up the hardware aspects of any drives attached to an array controller, 
including the drives in the shared MSA1000 storage systems. The ACU can initially 
configure the array controller, reconfigure the array controller, add additional disk 
drives to an existing configuration, and expand capacity. The ACU stores the drive 
configuration information on the drives themselves; therefore, after you have 
configured the drives from one of the cluster nodes, it is not necessary to configure 
the drives from the other cluster node. 

For detailed information about configuring the drives, refer to the section on the ACU 
in the Compaq shared external storage documentation. 

Setting Up a Dedicated Interconnect 

There are two ways to set up a dedicated interconnect: 

• Ethernet direct connect 

• Ethernet direct connect using a private hub or switch 

Ethernet Direct Connect 

An Ethernet crossover cable is included with your Compaq ProLiant cluster kit. This 
cable directly connects two NIC that have been designated as the dedicated 
interconnect. Connect one end of the cable to the NIC in node 1 and the other end of 
the cable to the NIC in node 2. 

IMPORTANT: Connect the cable to the dedicated interconnect NICs and not to the Ethernet 
connections used for the network clients (the public LAN). 

NOTE: The crossover cable will not work in conjunction with a network hub or switch. 

After the installation of the dedicated interconnect is complete, assign static IP 
addresses to both NICs. Compaq recommends using addresses that adhere to 
RFC 1918, Address Allocation for Private Internets. 
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Table 3-2 provides the address range based on class. 



Table 3-2: RFC 1918 Address Ranges 


Class 


Address Range 


A/8 


10.0.0.0 to 10.255.255.255 


B/16 


172.16.0.0 to 172.31.255.255 


C/24 


192.168.0.0 to 192.168.255.255 



Ethernet Direct Connect Using a Private Hub or Switch 

An Ethernet hub or switch requires standard Ethernet cables; Ethernet crossover 
cables will not work with a hub or switch. To cable the server interconnect using an 
Ethernet hub or switch: 

1 . Connect the end of one of the Ethernet cables to the NIC in node 1 . 

2. Connect the other end of the cable to a port in the hub or switch. 

3. Repeat steps 1 and 2 for the NIC in node 2. 

4. Assign static IP addresses. 

IMPORTANT: Place the cable into the dedicated interconnect NICs and not into the Ethernet 
connections used for the network clients (the public LAN). 

Setting Up a Public Interconnect 

It is possible — but not recommended — to use a public network as your dedicated 
interconnect path. To set up a public Ethernet interconnect, connect the NICs, hub, 
and cables as you would in a nonclustered environment. Then configure the NICs for 
both network clients and for the dedicated interconnect. 
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Compaq recommends using static IP addresses on the public LAN. Do not use 
DHCP-assigned addresses. 

IMPORTANT: Using a public network as your dedicated interconnect path is not 
recommended because it represents a potential single point of failure for cluster 
communication. 



Redundant Interconnect 

MSCS allows you to configure any certified network card as a possible path for 
intracluster communication. If you are employing a dedicated interconnect, use 
MSCS to configure your LAN network cards to serve as a backup for your 
interconnect. 

See the "Recommended Cluster Communication Strategy" section in Chapter 2 of 
this guide for more information about setting up redundancy for intracluster and 
cluster-to-LAN communication. 

Installing the Software 

The following sections describe the software installation steps for the HA/F100 and 
the HA/F200 for MSA1000 clusters. Proceed with these steps after you have all 
equipment installed and your hubs or switches, storage system, and one server 
powered up. 

You will need the following during installation: 

IMPORTANT: See to Appendix A for the software and firmware version levels your cluster 
requires. 

• Compaq SmartStart and Support Software 

• Compaq SmartStart Setup Poster 

• Server Profile Diskette (included with SmartStart) 

• Compaq Modular SAN Array 1000 Support Software CD 
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• One of the following operating systems: 

— Microsoft Windows Enterprise Edition software and documentation 

— Microsoft Windows 2000 Advanced Server software and documentation 

• Microsoft Service Packs 

• Compaq SAN works Secure Path Version 3. IB for Windows Workgroup Edition 
(HA/F200 for MSA1000 cluster only) 

• Monitoring and Management Software 

— Compaq Insight Manager software and documentation 

— Compaq Insight Manager XE software and documentation 

— Compaq Insight Manager 7 software and documentation 

— Compaq Intelligent Cluster Administrator software and documentation 

• At least 10 high-density diskettes 

Assisted Integration Using SmartStart (Recommended) 

IMPORTANT: Prior to the installation of Microsoft Windows 2000 Advanced Server, upgrade 
the system ROM on each node with the latest systems ROMPaq from the Compaq website 
(www.compaq.com/support). 

Use the SmartStart Assisted Integration procedure to configure the nodes (servers) in 
the HA/F100 and HA/F200 for MSA1000 configurations. You will set up two nodes 
during this process. Proceed through all of the steps on each of the nodes, with noted 
exceptions. 



A CAUTION: Installation using SmartStart assumes that SmartStart is being installed 
on new servers. Any existing data on the servers' boot drive will be erased. 
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Cluster-Specific SmartStart Installation 

The SmartStart Setup Poster describes the typical procedure for configuring and 
installing software on a single server. The installation for HA/F100 and HA/F200 for 
MSA 1000 clusters is similar. The difference between running SmartStart on a stand- 
alone server and running SmartStart for a cluster is noted below: 

• Through the ACU, you can configure the shared drives on both servers. For 
cluster configuration, configure the drives on the first server, then accept the 
same settings for the shared drives when given the option on the second server. 

• When configuring drives through the ACU, create a logical drive with 500 MB of 
space to be used as the quorum disk. 

Assisted Integration Installation Steps 

IMPORTANT: Power down node 2 when setting up node 1 . 

1 . Power up your hardware in the following order: 

a. Fibre Channel storage hub or Fabric switch 

b. Shared storage (wait for drives to spin up) 

c. Node 1 

2. Insert the SmartStart CD in the CD-ROM drive. The CD will automatically run. 

3. Select the Assisted Integration installation path. Follow steps outlined in the 
SmartStart Setup Poster. 

4. Select one of the following when SmartStart prompts for the operating system: 

— Microsoft Windows NT Server 4.0/Enterprise Edition (Retail) 

— Microsoft Windows NT Server 4.0/Enterprise Edition (Select) 

— Microsoft Windows 2000 Advanced Server (Retail) 

— Microsoft Windows 2000 Advanced Server (Select) 
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5. Press Enter after the hardware configuration utility has run. SmartStart will 
automatically run the ACU for drives connected to an internal array controller. 

After you have completed using the ACU, the system will reboot and SmartStart 
will automatically create your system partition. 

NOTE: If the node being configured has an array controller attached to the server-centric hard 
drives, the server array controller will also need to be configured at this time. 

6. Install additional Compaq software and utilities and choose the boot partition. 

7. Create the Options ROMPaq™ in the Diskette Builder Utility. Label the diskettes 
you create. The Options ROMPaq updates the firmware on the server array 
controllers and the hard drives. 

The node will reboot to prepare for the operating system installation. 

IMPORTANT: In step 7, when configuring node 2, click Next to exit the Diskette Builder Utility 
and go to step 8. 

8. Insert the Microsoft Windows CD when prompted. Install the appropriate 
Service Pack: 

— If installing Windows Enterprise Edition, install Service Pack 6a when 
prompted for a Service Pack. After the Service Pack is installed, the node 
reboots and Enterprise Edition Installer loads automatically. Exit the 
Enterprise Edition Installer. 

— If installing Windows 2000, install Service Pack 2. 

9. Insert the Compaq Modular SAN Array Support Software CD. Click Install 
Online Array Configuration Utility. 

10. After ACU is installed, select the appropriate option to install the HBA drivers 
for your operating system. The following drivers are installed during this process: 

— Compaq Storage Works FCA-2101 driver 

— Compaq Storage Works Modular SAN Array 1000 Controller driver 

— Compaq Storage Works Modular SAN Array Fabric Switch 6 driver 
Reboot the system after the drivers are installed. 

11. Run the ACU from within the installed Windows operating system. 
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12. Choose the custom configuration option to create RAID sets on your MSA 1000 
storage system. Refer to the user guide for the MSA 1000 storage system for more 
details. 

IMPORTANT: Node 2 exception: In step 12 when configuring node 2, the ACU shows the 
results of the shared drives configured during node 1 setup. Accept these changes for node 2 
by exiting the ACU. 

NOTE: Create a logical drive on one of the MSA1000 Arrays with 500 MB of space to be used 
as the quorum drive. 

13. Open the Disk Administrator for Windows Enterprise Edition or Disk 
Management for Windows 2000 Advanced Server. If prompted for drive 
signature stamp, choose Yes. If prompted to upgrade disks, choose No because 
MSCS does not support dynamic disks in a cluster. 

14. If configuring an HA/F200 for MSA1000 cluster, install Secure Path: 

a. During the installation, you will be prompted to configure your clients. Insert 
the Secure Path CD to automatically start the Secure Path installation 
process. Alternatively, double-click the following file on the CD: 

<CD-ROM drive> : \SPInstall\setup . exe 

b. Remove the Compaq SANworks Secure Path CD from the CD-ROM drive. 

c. Reboot the node when prompted. 

To use Secure Path, click Start, Programs, SecurePath, SPM. 

NOTE: If you have problems authorizing client connections using Fully Qualified Domain 
Names (FQDN), it may be due to a Domain Name Service (DNS) resolution issue, and can be 
resolved by a HOSTs file entry containing relevant FQDN-to-IP address mapping. 

For more detailed information on Secure Path, refer to the Secure Path 
documentation. 
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15. Install MSCS for node 1. 

— For Windows Enterprise Edition, open the Enterprise Edition Installer and 
install MSCS on both cluster nodes as outlined in MSCS documentation. 

— For Windows 2000 Advanced Server, install the Cluster Service (MSCS) 
component in Add/Remove Programs. For more information on installing 
and configuring MSCS, refer to your Windows 2000 Advanced Server 
documentation. 

16. Install the Microsoft Service Packs on node 1. 

— If installing Windows Enterprise Edition, install Microsoft Windows NT 
Service Pack 6a after cluster installation is complete. 

— If installing Microsoft Windows 2000 Advanced Server, install Microsoft 
Windows 2000 Service Pack 2 after cluster installation is complete. 

For the latest information on Service Packs, refer to your Microsoft provider 
or the Microsoft website at 

www. m icrosoft.com 

17. Run the appropriate support software. 

— For Microsoft Windows Enterprise Edition, run Compaq Support Paq for 
Microsoft Windows NT to verify that all installed drivers are current. This 
service can be run from the following path on the SmartStart CD: 

x : \cpqsupsw\ntcsp\setup . exe 

— For Microsoft Windows 2000 Advanced Server, run Compaq Support Paq 
for Windows 2000 to verify that all installed drivers are current. This service 
can be run from the following path on the SmartStart CD: 

x : \cpqsupsw\ntcsp\setup . exe 

For the latest versions of the support software for Microsoft Windows Enterprise 
Edition, or Microsoft Windows 2000 Advanced Server, refer to the Compaq 
support website at 

www.compaq.com/support 
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18. Power on node 2 and repeat steps 2 through 17. 

19. Install your applications and managing and monitoring software. Refer to the 
Compaq Insight Manager Installation Poster for information on installing 
Compaq Insight Manager on the management console and Insight Management 
Agents on servers and desktops. 

Compaq Intelligent Cluster Administrator CD is located in your HA/F200 for 
MSA 1000 cluster kit and is available as an orderable option for the HA/F100 for 
MSA 1000 cluster. Installation steps for installing Compaq Intelligent Cluster 
Administrator can be found later in this chapter and in the Compaq Intelligent 
Cluster Administrator Quick Setup Guide. 

Compaq Intelligent Cluster Administrator 

Compaq Intelligent Cluster Administrator (CICA) supports a variety of preconfigured 
cluster options. These options can be initialized on your cluster if you have the 
appropriate software installed. After you install CICA you can select from a menu of 
preconfigured cluster configurations, and they will automatically be applied to your 
cluster. 

The Compaq Intelligent Cluster Administrator User Guide Version 2.1 and CD are 
located in your HA/F200 for MSA1000 cluster kit. If you are setting up an HA/F100 
for MSA 1000 configuration, you can order CICA separately. 

Installing Compaq Intelligent Cluster Administrator 

To install CICA on your system: 

1 . Insert the Compaq Intelligent Cluster Administrator CD. 

2. Click Explore. 

3 . Double-click the CICA folder. 

4. Double-click SETUP.EXE. 

CICA begins the installation. If a previous version of the product is installed, the 
service is stopped and the new version is installed. 
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5. Double-click the Setup icon on the installation disk and follow the instructions. 
The program will be deployed into the C:\COMPAQ\CICA directory. If this 
directory does not exist, the installation program will create it. After it is 
installed, the files should not be moved. 

6. Set the effective User ID for the CICA service to the Windows NT or Windows 
2000 domain administrator user account. 

7. Repeat these steps to install the software on the other cluster node. 

For more specific instructions about using CICA, refer to the Compaq Intelligent 
Cluster Administrator User Guide Version 2.1, which is included in your HA/F200 
for MSA 1000 cluster kit, or the Compaq high availability website at 

www.compaq.com/highavailability 

Additional Cluster Verification Steps 

The following information describes several Microsoft Cluster Administrator steps 
for verifying the creation of the cluster, verifying node failover, and verifying 
network client failover. 

Verifying the Creation of the Cluster 

After you have installed the software, verify creation of the cluster: 

1 . Shut down and power down both servers. 

2. Power down and then power on the MSA1000 storage system. 

3. Power up both servers. 

When Windows finishes booting up on both servers, follow these steps to use 
Microsoft Cluster Administrator to verify creation of the cluster: 

1 . Select Start, Programs, Administrative Tools (Common), and Cluster 
Administrator from the Windows desktop on either cluster server. 
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2. Enter the name or IP address of one of the cluster nodes when you are prompted 
for cluster or server name. 

If the cluster has been created correctly, the computer names of both cluster 
nodes are displayed on the left side of the Cluster Administrator window (see 
Figure 3-2). 
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Figure 3-2: Microsoft Cluster Administrator 

If the cluster is not working correctly, see the installation troubleshooting tips in 
Chapter 7. 



Verifying Node Failover 

NOTE: Do not run any client activity while testing failover events. 



To verify failover of a cluster node: 

1 . Select Start, Programs, Administrative Tools (Common), and Cluster 
Administrator from the Windows desktop on both servers. 

2. Enter the name or IP address of one of the cluster nodes when you are prompted 
for cluster or server name. 
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3. Verify that the server owns some of the resources and groups you will be 
powering off, so that a failure event will result in failover of resources and/or 
groups. Make sure all predefined resources and groups are online. 

4. Power off one of the cluster nodes. 

Within several seconds, Microsoft Cluster Administrator will bring online all of 
the predefined resources and groups that were previously owned by the powered- 
off server. If, after a minute, nothing seems to have happened, refresh the screen 
by selecting Refresh (F5). 

If failover is not working correctly, see the installation troubleshooting tips in 
Chapter 7. 

Verifying Network Client Failover 

After you have verified that each server is correctly running as a cluster node, the 
next step is to verify that network clients can interact with the cluster. 

To verify network client failover: 

1 . Ensure that both cluster nodes are running, and verify, by means of Microsoft 
Cluster Administrator, that all groups and resources are online. 

For each hard disk in the shared storage, MSCS automatically creates a cluster 
group that consists of a single resource, the disk drive. 

2. Add an existing IP address as another resource to one of these groups using 
Microsoft Cluster Administrator. Do not use the Cluster Group. Save the changes 
and return to the main Cluster Administrator screen. 

3. Open a DOS window on a network client machine. 
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4. Ensure that the network client can access the IP address. Regardless of whether 
you are using WINS or DHCP, you can execute the ping command to check the 
connection. 

Execute a ping command from the network client, using the cluster IP address as 
the argument. 

— The client has successfully accessed the cluster resource if you get a response 
similar to: 

Reply from <IP Address>: bytes=xx time=xxxms TTL=xx 

— The client has not successfully accessed the cluster resource if you get a 
response of: 

Reply from <IP Address>: Destination host unreachable 

5. Use Microsoft Cluster Administrator to perform a manual failover of the cluster 
group that contains the IP address. 

6. Execute the ping command again after the manual failover is complete. 

As soon as the other node brings the cluster group online, a response similar to the 
one noted in step 4 should be returned. If the client successfully accessed the failed- 
over IP address, your cluster is working. If the client was unsuccessful, either the 
cluster group was not configured correctly, the failover did not occur, or the ping 
command was performed before the failover activity was completed. 

If network client failover is not working correctly, see the installation troubleshooting 
tips in Chapter 7. 

To verify a more extreme case, instead of failing over the IP address, power off the 
primary cluster node and verify that the resource fails over to the other node. 
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Upgrading the ProLiant Clusters HA/F100 
for MSA1000 and HA/F200 for MSA1000 



This chapter details the procedures for the following cluster upgrades: 

• HA/F100 for MSA1000 Windows 2000 Advanced Server to HA/F200 for 
MSA 1000 Windows 2000 Advanced Server 

• HA/F100 for MSA1000 Windows Enterprise Edition to HA/F200 for MSA1000 
Windows 2000 Advanced Server 

• HA/F100 for MSA1000 Windows Enterprise Edition to HA/F200 for MSA1000 
Windows Enterprise Edition 

This chapter assumes that you are familiar with basic cluster management and 
operation. It also assumes that you are familiar with the hardware and software 
configuration details outlined in Chapter 3 of this guide. 

Even though some of the procedures in this chapter detail how to upgrade clusters by 
loading one operating system over another, Compaq strongly recommends that you 
upgrade the cluster servers from Windows Enterprise Edition to Windows 2000 
Advanced Server using a new installation of Windows 2000 Advanced Server. Using 
a new installation of Windows 2000 Advanced Server clears all the data from the 
server's hard disks and eliminates any installation problems from loading 
Windows 2000 Advanced Server on top of Windows Enterprise Edition. 
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All of the procedures outlined in this chapter preserve the data on the cluster's 
external storage systems. On a case-by-case basis, some of the procedures allow the 
cluster to be upgraded while continuing to service clients. These upgrades are called 
"rolling upgrades." During these rolling upgrades, however, the cluster will be 
operating in a degraded state since one of the servers will be unavailable to clients. 
Other installations will require a new installation of the operating system without 
giving an option to load Windows 2000 Advanced Server over Windows Enterprise 
Edition. Read any upgrade procedure thoroughly before starting it. Some of these 
procedures require all nodes of the cluster to be turned off at the same time causing 
the cluster to be unavailable to clients during the upgrade. 

IMPORTANT: These procedures may be updated over time. For additional information on 
upgrading to Windows 2000 Advanced Server visit the Microsoft website 
(www.microsoft.com) or the Compaq High Availability website 
(www.compaq.com/highavailability). 

NOTE: These steps assume that you currently have an existing cluster in an operational 
state. 



Pre-upgrade Preparation 

A cluster can be upgraded to Windows 2000 Advanced Server without migrating the 
primary domain controller. However, if the cluster is used as a backup domain 
controller, the primary domain controller must be migrated first. For more detailed 
information on migrating domain controllers, refer to your Windows 2000 Advanced 
Server documentation included with your operating system purchase. 

When upgrading from an HA/F100 for MSA1000 cluster to an HA/F200 for 
MSA 1000 cluster, additional hardware is required. The difference between the 
clusters is the addition of a second redundant Fibre Channel loop. To complete the 
installation of this loop, the following components are required: 

• One additional Compaq Storage Works MSA 1000 Controller per storage 
subsystem 
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• One additional of the following Fibre Channel storage hub or Fabric switch 
components: 

— Compaq StorageWorks Fibre Channel Storage Hub 7 (7 -port FC-AL Hub) 

— Compaq StorageWorks Fibre Channel SAN Switch 8 (SAN Switch/8) 

— Compaq StorageWorks Fibre Channel SAN Switch 16 (SAN Switch/ 16) 

— Compaq StorageWorks Fibre Channel SAN Switch 8-EL (SAN Switch/8EL) 

— Compaq StorageWorks Fibre Channel SAN Switch 16-EL 
(SAN Switch/1 6EL) 

— Compaq StorageWorks Modular SAN Array Fabric Switch 6 
(MSA Fabric Switch 6) 

• One additional Compaq StorageWorks FCA-2101 host bus adapter 
(FCA-2101 HBA) 

• Redundancy management software: 

— Compaq SAN works Secure Path Version 3. IB for Windows Workgroup 
Edition (Secure Path) 

• Additional GBIC-SWs and Fibre Channel cables 

In addition to the previously mentioned requirements the following items are needed 
for any cluster upgrade: 

• Compaq SmartStart and Support Software 

• Compaq SmartStart for Servers Setup Poster 

• Server Profile Diskette (included with SmartStart) 

• Compaq Modular SAN Array 1000 Support Software CD 
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• One of the following operating systems and the documentation received with the 
operating system: 

— Microsoft Windows 2000 Advanced Server 

— Microsoft Windows NT Server 4.0, Enterprise Edition 

• Microsoft Service Packs 

Ensure the following documentation is available for reference: 

• Compaq SmartStart for Servers Setup poster 

• Microsoft clustering documentation 

• Compaq ProLiant servers documentation 

• Compaq shared external storage documentation 

• Compaq host bus adapter documentation 

• Fibre Channel storage hub or Fabric switch documentation 

— Compaq StorageWorks Fibre Channel Storage Hub 7 Installation Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 8 Installation and 
Hardware Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 16 Installation and 
Hardware Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 8-EL Installation and 
Hardware Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 16-EL Installation and 
Hardware Guide 

— Compaq StorageWorks Modular SAN Array Fabric Switch 6 User Guide 
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• Monitoring and management software 

— Compaq Insight Manager 

— Compaq Insight Manager XE 

— Compaq Insight Manager 7 

— Compaq Intelligent Cluster Administrator 

NOTE: See Appendix A to determine which Service Packs and software and firmware version 
levels your cluster requires. 

Upgrade Process I: HA/F100 for MSA1000 with 
Windows 2000 Advanced Server to HA/F200 for 
MSA1000 with Windows 2000 Advanced Server 

The upgrade of HA/F100 for MSA1000 with Windows 2000 Advanced Server to 
HA/F200 for MSA 1000 with Windows 2000 Advanced Server can be performed 
while keeping your cluster online (a rolling upgrade). 

IMPORTANT: Back up all data before beginning the upgrade process. 

The basic steps to this rolling upgrade include: 

1. Back up all data. 

2. Ensure that all components are present for the second Fibre Channel loop. 

3. Move all cluster resources over to node 2. 

4. Add the redundant loop hardware. 

5. Upgrade the hardware on node 1. 

6. Install Secure Path on node 1. 

7. Move all cluster resources back to node 1. 

8. Upgrade hardware on node 2. 

9. Install Secure Path on node 2. 
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To upgrade your cluster: 

1. Back up all data. 

2. Upgrade node 1: 

a. Move all of the resource groups, including the cluster group, to node 2 using 
the Cluster Administrator Utility. 

b. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 

c. Right-click Cluster Group and select Move Group. 

d. Repeat step c until all resource groups listed in the Cluster Administrator 
display have been moved to node 2. 

NOTE: For help on moving cluster groups, refer to the Microsoft clustering documentation 
that came with your operating system. 

3. Install Windows 2000 Service Pack 2 on node 1 and reboot when prompted. 

4. Install new hardware: 

a. Shut down node 1 and install the second host bus adapter, additional 
redundant MSA1000 Controllers, the additional Fibre Channel storage hub or 
Fabric switch, and cables as necessary. 

IMPORTANT: To ensure that your HA/F200 for MSA1000 cluster fails over properly, 
attach each host bus adapter in a single server to a different Fibre Channel storage hub or 
Fabric switch, and attach each MSA1000 I/O Module to a different Fibre Channel storage 
hub or Fabric switch. 

b. Power on node 1 and run the System Configuration Utility by pressing the 
F10 key when prompted during boot. 

c. Verify the configuration changes to the system (a host bus adapter was 
added). Ensure the boot order is set correctly. 

d. Save the configuration changes and reboot. 
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5. Add the Fibre Channel host bus adapter driver when the Windows 2000 Found 
New Hardware Wizard appears. 

a. Click Cancel in the Found New Hardware Wizard window. 

b. Insert the Compaq Modular SAN Array 1000 Support Software CD and 
install the drivers for Windows 2000. Reboot when prompted. 

NOTE: Ensure the HBA firmware revisions are the same. 

6. Install Secure Path on node 1: 

a. Insert the Secure Path CD to automatically start the Secure Path installation 
process or double-click the following file on the CD: 

[CD-ROM DRIVE] : \SPInstall\setup . exe 

b. Configure your clients during the installation. 

c. Remove the Secure Path CD from the CD-ROM drive. 

d. Reboot the node when prompted. 

NOTE: For more detailed information on Secure Path, refer to the Secure Path 
documentation. 

7. Upgrade the node 2 cluster by repeating steps 1 though 6 for node 2 and then 
continuing with step 8. 

8. Verify that Secure Path is running properly and that the redundant paths are 
operational by starting the Secure Path Manager Utility. 

NOTE: See the Secure Path documentation for details on running this program. 

9. Install your applications, managing and monitoring software as needed. These 
may include Compaq Insight Manager, Compaq Insight Manager XE, Compaq 
Insight Manager 7, Compaq Intelligent Cluster Administrator, and other 
applications needed on this cluster. 

The upgrade of HA/F100 for MSA1000 with Windows 2000 Advanced Server to 
HA/F200 for MSA1000 with Windows 2000 Advanced Server is complete. 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA 1000 Administrator Guide 



4-7 



Upgrading the ProLiant Clusters HA/F100 for MSA1000 and HA/F200 for MSA 1000 



Upgrade Process II: HA/F100 for MSA1000 Windows 
Enterprise Edition to HA/F200 for MSA1000 
Windows 2000 Advanced Server 

The upgrade of HA/F100 for MSA1000 Windows Enterprise Edition to HA/F200 for 
MSA 1000 Windows 2000 Advanced Server can be performed while keeping your 
cluster online (a rolling upgrade). 

IMPORTANT: Back up all data before beginning the upgrade process. 

IMPORTANT: Prior to beginning this upgrade, both nodes running Windows Enterprise 
Edition must have Service Pack 6a for Windows Enterprise Edition installed. 

The basic steps to this rolling upgrade include: 

1. Back up all data. 

2. Ensure that all components are present for the second Fibre Channel loop. 

3. Move all cluster resources over to node 2. 

4. Remove any cluster fixes that may be installed. 

5. Run Compaq Primer Utility. 

6. Apply Service Pack 6a on node 1 again. 

7. Upgrade the operating system and drivers on node 1. 

8. Add the redundant loop hardware. 

9. Upgrade the hardware on node 1. 

10. Install Secure Path on node 1. 

11. Move all cluster resources back to node 1. 

12. Upgrade the operating system and drivers on node 2. 

13. Upgrade the hardware on node 2. 

14. Install Secure Path on node 2. 
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To upgrade your cluster: 

1. Back up all data. 

2. Upgrade node 1: 

a. Move all of the resource groups, including the cluster group, to node 2 using 
the Cluster Administrator Utility. 

b. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 

c. Right-click Cluster Group and select Move Group. 

d. Repeat step c until all resource groups listed in the Cluster Administrator 
display have been moved to node 2. 

NOTE: For help on moving cluster groups, refer to the Microsoft clustering documentation 
that came with your operating system. 

3. Using Add/Remove Programs in the Control Panel, remove any cluster fixes 
that may be installed on node 1. 

4. Run the Compaq Windows 2000 Primer Utility to remove any conflicting device 
drivers or applications that might cause problems during the install or operation 
of Windows 2000 Advanced Server: 

NOTE: The Primer Utility only removes Compaq Utilities that are not compatible with 
Windows 2000. The Primer Utility can be downloaded from the following Compaq website: 

www.compaq.com/partners/microsoft 

a. Execute the Primer Utility and click Cleanup to begin the process that 
removes Windows 2000 non-supported utilities. 

b. After the Primer Utility has run, you will be asked to reboot the node to 
finish the cleanup process. 

5. Apply Service Pack 6a to node 1 and reboot when prompted. 
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6. Upgrade node 1 to Windows 2000 Advanced Server: 

a. After the node has rebooted, insert the Windows 2000 Advanced Server CD 
into the CD-ROM drive. If the Autorun feature for CDs has been turned off, 
select Start, Run. At the prompt enter 

[CD-ROM DRIVE] : \Setup.exe 

When asked if you want to upgrade Windows, click Yes. 

b. Select the Upgrade to Windows 2000 option and click Next. 

c. Select the I accept this agreement option and click Next. 

d. During the upgrade, the Microsoft setup process will automatically restart the 
computer after 15 seconds or click Finish to restart manually. The upgrade 
process is automatic and will take some time. 

e. When the upgrade is complete, log on to the node as the administrator. 

NOTE: After the Windows 2000 Advanced Server upgrade is complete the Cluster 
Service might fail to start since the DNS client for node 1 has not been set up. The 
problem that occurs is an authentication error with the user account used to start the 
Cluster Service. The error message is expected and will be resolved after the DNS client 
is set up. 

7. If DNS is not already installed, set up the DNS client on node 1 : 

a. Right-click My Network Places and click Properties. 

b. Right-click on the network connection that the clients will use to connect to 
the domain controller and click Properties. The Properties screen displays 
the current options available for the selected network card. 

c. Highlight Internet Protocol (TCP/IP) and click Properties. 

d. Select Use the following DNS server addresses. 

e. In the Preferred DNS server field, enter the Preferred DNS server IP 
address. 
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f. Click Advanced. 

g. Select the DNS tab. 

h. In the DNS server addresses, in order of use field, enter the DNS suffix. 

i. Click OK. 

8. After the client setup is complete, click Start, Shutdown. 

9. Select Log Off Administrator, then log back into the system as the 
Administrator. (This ensures that the domain controller has properly 
authenticated the machine and user ID.) 

10. Install the latest Compaq drivers and server support software by running the 
Compaq Support Paq for Windows 2000 on node 1 : 

a. Insert the SmartStart CD. Select Start, Run. In the dialog box that displays, 
enter 

[CD-ROM DRIVE] : \cpqsupsw\ntcsp\setup . exe 

NOTE: You can also acquire the latest Compaq Support Paq from the Compaq website 
at 

www.compaq.com/support 

b. After the installation is completed, the Compaq Support Paq window will 
show the drivers that were updated and the drivers that were not installed 
because of a current version of the driver on the node or because of a lack of 
hardware on the node. Reboot when prompted. 

1 1 . Install Windows 2000 Service Pack 2 on node 1 and reboot when prompted. 

12. Uninstall the Emulex LP8000 Fibre Channel host adapter driver: 

a. Click Start, Programs, Administrator Tools, Computer Management. 

b. Click Device Manager. 

c. In the right-hand pane under SCSI and Raid Controllers, uninstall the 
Emulex LP8000 Fibre Channel host adapter driver. 
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13. Install new hardware: 

a. Shut down node 1 and install the second host bus adapter, additional 
redundant MSA1000 Controllers, the additional Fibre Channel storage hub or 
Fabric switch, and cables as necessary. 

IMPORTANT: To ensure that your HA/F200 for MSA1000 cluster fails over properly, 
attach each host bus adapter in a single server to a different Fibre Channel storage hub or 
Fabric switch, and attach each MSA1000 Controller in the MSA1000 to a different Fibre 
Channel storage hub or Fabric switch. 

b. Turn on node 1 and run the System Configuration Utility by pressing the F10 
key when prompted during boot. 

c. Verify the configuration changes to the system (a host bus adapter was 
added). Ensure the boot order is set correctly. 

d. Save configuration changes and reboot. 

14. Add the Fibre Channel host bus adapter driver when the Windows 2000 Found 
New Hardware Wizard appears: 

a. Click Cancel in the Found New Hardware Wizard window. 

b. Insert the Compaq Modular SAN Array 1000 Support Software CD and 
install the HBA drivers. 

c. Reboot as requested. 

15. Install Secure Path on node 1: 

a. Insert the Secure Path CD to automatically start the Secure Path installation 
process or double-click the following file on the CD: 

[CD-ROM DRIVE] : \SPInstall\setup . exe 

b. Configure your clients during the installation. 

c. Remove the Secure Path CD from the CD-ROM drive. 

d. Reboot the node when prompted. 

NOTE: For more detailed information on Secure Path, refer to the Secure Path 
documentation. 

16. Reapply Windows 2000 Service Pack 2; reboot when prompted. 
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17. Upgrade node 2 in a cluster by repeating steps 1 through 16 for node 2 and then 
continuing with step 18. 

18. Verify that Secure Path is running properly and that the redundant paths are 
operational by starting the Secure Path Manager Utility. 

NOTE: See the Secure Path documentation for details on running this program. 

19. Install your applications, managing and monitoring software as needed. These 
may include Compaq Insight Manager, Compaq Insight Manager XE, Compaq 
Insight Manager 7, Compaq Intelligent Cluster Administrator, and other 
applications needed on this cluster. 

The upgrade of HA/F100 for MSA1000 Windows Enterprise Edition to HA/F200 for 
MSA 1000 Windows 2000 Advanced Server is complete. 

Upgrade Process III: HA/F100 for MSA1000 Windows 
Enterprise Edition to HA/F200 for MSA1000 Windows 
Enterprise Edition 

The upgrade of HA/F100 for MSA1000 Windows Enterprise Edition to HA/F200 for 
MSA 1000 Windows Enterprise Edition can be performed while keeping your cluster 
online (a rolling upgrade). 

IMPORTANT: Back up all data before beginning the upgrade process. 

IMPORTANT: Prior to beginning this upgrade, both nodes running Windows Enterprise 
Edition must have Service Pack 6a for Windows Enterprise Edition installed. 

The basic steps to this rolling upgrade include: 

1. Back up all data. 

2. Ensure that all components are present for the second Fibre Channel loop. 

3. Move all cluster resources over to node 2. 

4. Add the redundant loop hardware. 

5. Upgrade the hardware on node 1. 
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6. Install Secure Path on node 1. 

7. Move all cluster resources back to node 1. 

8. Upgrade hardware on node 2. 

9. Install Secure Path on node 2. 

To upgrade your cluster: 

1. Back up all data. 

2. Upgrade node 1: 

a. Move all of the resource groups, including the cluster group, to node 2 using 
the Cluster Administrator Utility. 

b. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 

c. Right-click Cluster Group and select Move Group. 

d. Repeat step c until all resource groups listed in the Cluster Administrator 
display have been moved to node 2. 

NOTE: For help on moving cluster groups, refer to the Microsoft clustering documentation 
that came with your operating system. 

3. Install Windows NT Service Pack 6a on node 1 and reboot when prompted. 

4. Install new hardware: 

a. Shut down node 1 and install the second host bus adapter, additional 
redundant MSA1000 Controllers, the additional Fibre Channel storage hub or 
Fabric switch, and cables as necessary. 

IMPORTANT: To ensure that your HA/F200 for MSA1000 cluster fails over properly, 
attach each host bus adapter in a single server to a different Fibre Channel storage hub or 
Fabric switch, and attach each MSA1000 I/O Module to a different Fibre Channel storage 
hub or Fabric switch. 

b. Power on node 1 and run the System Configuration Utility by pressing the 
F10 key when prompted during boot. 
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c. Verify the configuration changes to the system (a host bus adapter was 
added). Ensure the boot order is set correctly. 

d. Save the configuration changes and reboot. 

5. Insert the Modular SAN Array 1000 Support Software CD and install the drivers 
for Windows NT. Reboot when prompted. 

NOTE: Ensure the HBA firmware revisions are the same. The utility to update the HBA 
firmware is located on the Compaq Modular SAN Array Support Software CD. For more 
information, refer to the HBA documentation. 

6. Install Secure Path on node 1: 

a. Insert the Secure Path CD to automatically start the Secure Path installation 
process or double-click the following file on the CD: 

[CD-ROM DRIVE] : \SPInstall\setup . exe 

b. Configure your clients during the installation. 

c. Remove the Secure Path CD from the CD-ROM drive. 

d. Reboot the node when prompted. 

NOTE: For more detailed information on Secure Path, refer to the Secure Path 
documentation. 

7. Upgrade node 2 by repeating steps 1 though 6 for node 2 and then continuing 
with step 8. 

8. Verify that Secure Path is running properly and that the redundant paths are 
operational by starting the Secure Path Manager Utility. 

NOTE: See the Secure Path documentation for details on running this program. 

9. Install your applications, managing and monitoring software as needed. These 
may include Compaq Insight Manager, Compaq Insight Manager XE, Compaq 
Insight Manager 7, Compaq Intelligent Cluster Administrator, and other 
applications needed on this cluster. 

The upgrade of HA/F100 for MSA1000 Windows Enterprise Edition to HA/F200 for 
MSA 1000 Windows Enterprise Edition is complete. 
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5 



Migrating the Operating System of the 
HA/F100 for MSA1000 and HA/F200 for 

MSA1000 



This chapter details the procedures for the following cluster migrations: 

• HA/F100 for MSA1000 Windows Enterprise Edition to HA/F100 for MSA1000 
Windows 2000 Advanced Server 

• HA/F200 for MS A 1 000 Windows Enterprise Edition to HA/F200 for MS A 1 000 
Windows 2000 Advanced Server 

This chapter assumes you are familiar with basic cluster management and operation. 
It also assumes that you are familiar with the hardware and software configuration 
details outlined in Chapter 3 of this guide. 

Even though some of the procedures in this chapter detail how to migrate clusters by 
loading one operating system over another, Compaq strongly recommends that you 
migrate the cluster servers from Windows Enterprise Edition to Windows 2000 
Advanced Server using a new installation of Windows 2000 Advanced Server. Using 
a new installation of Windows 2000 Advanced Server clears all the data from the 
server's hard disks and eliminates any installation problems from loading Windows 
2000 Advanced Server on top of Windows Enterprise Edition. 
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All of the procedures outlined in this chapter preserve the data on the cluster's 
external storage systems. On a case-by-case basis, some of the procedures allow the 
cluster to be upgraded while continuing to service clients. These upgrades are called 
"rolling upgrades." During these rolling upgrades, however, the cluster will be 
operating in a degraded state since one of the servers will be unavailable to clients. 
Other installations will require a new installation of the operating system without 
giving an option to load Windows 2000 Advanced Server over Windows Enterprise 
Edition. Read any migration procedure thoroughly before starting it. Some of these 
procedures require all nodes of the cluster to be turned off at the same time, causing 
the cluster to be unavailable to clients during the upgrade. 

IMPORTANT: These procedures may be updated over time. For additional information on 
migrating to Windows 2000 Advanced Server visit the Microsoft website 
(www.microsoft.com) or the Compaq High Availability website at 
(www.compaq.com/highavailability). 

NOTE: These steps assume that you currently have an existing cluster in an 
operational state. 



Pre-migration Preparation 

The following items are needed for any cluster migration: 

• Compaq SmartStart and Support Software 

• Server Profile Diskette (included with SmartStart) 

• Compaq Modular SAN Array 1000 Support Software CD 

• One of the following operating systems and the documentation received with the 
operating system: 

— Microsoft Windows 2000 Advanced Server 

— Microsoft Windows NT Server 4.0, Enterprise Edition 

• Microsoft Service Packs 
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Ensure the following documentation is available for reference: 

• Compaq SmartStart for Servers Setup poster 

• Microsoft clustering documentation 

• Compaq ProLiant servers documentation 

• Compaq shared external storage documentation 

• Compaq host bus adapter documentation 

• Fibre Channel storage hub or Fabric switch documentation 

— Compaq StorageWorks Fibre Channel Storage Hub 7 Installation Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 8 Installation and 
Hardware Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 16 Installation and 
Hardware Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 8-EL Installation and 
Hardware Guide 

— Compaq StorageWorks Fibre Channel SAN Switch 16-EL Installation and 
Hardware Guide 

— Compaq StorageWorks Modular SAN Array Fabric Switch 6 User Guide 

• Monitoring and management software 

— Compaq Insight Manager 

— Compaq Insight Manager XE 

— Compaq Insight Manager 7 

— Compaq Intelligent Cluster Administrator 

NOTE: See Appendix A to determine which Service Packs and software and firmware version 
levels your cluster requires. 
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Migration Process I: HA/F100 for MSA1000 with 
Windows Enterprise Edition to HA/F100 for MSA1000 
with Windows 2000 Advanced Server 

The migration of HA/F100 for MSA 1000 with Windows Enterprise Edition to 
HA/F100 for MSA1000 with Windows 2000 Advanced Server can be performed 
while keeping your cluster online (a rolling upgrade). 

IMPORTANT: Back up all data before beginning the migration process. 

IMPORTANT: Prior to beginning this migration, both nodes running Windows Enterprise 
Edition must have Service Pack 6a for Windows Enterprise Edition installed. 

The basic steps to this migration include: 

1. Back up all data. 

2. Move all cluster resources over to node 2. 

3. Remove any cluster fixes that may be installed. 

4. Run Compaq Primer Utility. 

5. Apply Service Pack 6a on node 1 again. 

6. Upgrade the operating system and drivers on node 1. 

7. Move all cluster resources back to node 1. 

8. Upgrade the operating system and drivers on node 2. 

To migrate your cluster: 

1. Back up all data. 

2. Migrate node 1: 

a. Move all of the resource groups, including the cluster group, to node 2 using 
the Cluster Administrator Utility. 

b. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 
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c. Right-click Cluster Group and select Move Group. 

d. Repeat step c until all resource groups listed in the Cluster Administrator 
display have been moved to node 2. 

NOTE: For help on moving cluster groups, refer to the Microsoft clustering documentation 
that came with your operating system. 

3. Using Add/Remove Programs in the Control Panel, remove any cluster fixes 
that may be installed on node 1 . 

4. Run the Compaq Windows 2000 Primer Utility to remove any conflicting device 
drivers or applications that might cause problems during the install or operation 
of Windows 2000 Advanced Server: 

NOTE: The Primer Utility only removes Compaq Utilities that are not compatible with 
Windows 2000. The Primer Utility can be downloaded from the following Compaq website: 

www.compaq.com/partners/microsoft 

a. Execute the Primer Utility and click Cleanup to begin the process that 
removes Windows 2000 non-supported utilities. 

b. After the Primer Utility has run, you will be asked to reboot the node to 
finish the cleanup process. 

5. Apply Service Pack 6a to node 1 and reboot when prompted. 

6. Upgrade node 1 to Windows 2000 Advanced Server: 

a. After the node has been rebooted, insert the Windows 2000 Advanced Server 
CD into the CD-ROM drive. If the Autorun feature for CDs has been turned 
off, click Start, Run. At the prompt enter 

[CD-ROM DRIVE] : \Setup.exe 

b. When asked if you want to upgrade Windows, click Yes. 

c. Select the Upgrade to Windows 2000 option and click Next. 

d. Select the I accept this agreement option and click Next. 
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e. During the upgrade, the Microsoft setup process will automatically restart the 
computer after 15 seconds or click Finish to restart manually. The upgrade 
process is automatic and will take some time. 

f. When the upgrade is complete, log on to the node as the administrator. 

7. Install Windows 2000 Advanced Server Service Pack 2 on node 1 and reboot the 
system when prompted. 

8. Uninstall the Emulex LP8000 Fibre Channel host adapter driver: 

a. Click Start, Programs, Administrator Tools, Computer Management. 

b. Click Device Manager. 

c. In the right-hand pane under SCSI and Raid Controllers, uninstall the 
Emulex LP8000 Fibre Channel host adapter driver. 

9. Install the latest Compaq drivers and server support software by running the 
Compaq Support Paq for Windows 2000 on node 1 : 

a. Insert the SmartStart CD and click Start, Run. 

b. In the displayed dialog box, enter 

[CD-ROM DRIVE] : \cpqsupsw\ntcsp\setup . exe 

NOTE: You can also acquire the latest Compaq Support Paq from the Compaq website: 
www.compaq.com/support. 

c. After the installation is completed, the Compaq Support Paq window will 
show the drivers that were updated and the drivers that were not installed 
because a current version of the driver exists on the node or because a lack of 
hardware on the node. 

10. Insert the Compaq Storage Works Modular SAN Array 1000 Support Software 
CD: 

a. Install the HBA drivers. 

b. Reboot as requested. 
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11. If DNS is not already installed, set up the DNS client on node 1: 

a. Right-click My Network Places and click Properties. 

b. Right-click the network connection that the clients will use to connect to the 
domain controller and click Properties. 

c. The Properties screen displays the current options available for the selected 
network card. Highlight Internet Protocol (TCP/IP), and click Properties. 

d. Select Use the following DNS server addresses. 

e. In the Preferred DNS server addresses field, enter the Preferred DNS 
server IP address. 

f. Click Advanced. 

g. Select the DNS tab. 

h. In the DNS server addresses, in order of use field, enter the DNS suffix. 

i. Click OK. 

j. After the client setup is complete, click Start, Shutdown. 

k. From the shutdown screen, select Log Off Administrator, then log back into 
the system as the administrator. (This ensures that the domain controller has 
properly authenticated the machine and user ID.) 
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12. Verify that the Cluster Service has started and that node 1 has rejoined the 
cluster: 

a. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 

b. As the Cluster Administrator opens, an error may display. This error is a 
result of the additional resources that Windows 2000 Advanced Server offers 
Microsoft Cluster Server. This error will not display when node 2 is 
upgraded to Windows 2000 Advanced Server. 

c. From the error screen, select Yes To All to open Cluster Administrator. 

13. Upgrade node 2 by repeating steps 1 through 12 for node 2 and then continuing 
with step 14. 

14. Install your applications, managing and monitoring software as needed. These 
may include Compaq Insight Manager, Compaq Insight Manager XE, Compaq 
Insight Manager 7, Compaq Intelligent Cluster Administrator, and other 
applications needed on this cluster. 

The migration of HA/F100 for MSA 1000 with Windows Enterprise Edition to 
HA/F100 for MSA1000 with Windows 2000 Advanced Server is complete. 

Migration Process II: HA/F200 for MSA1000 with 
Windows Enterprise Edition to HA/F200 for MSA1000 
with Windows 2000 Advanced Server 

The migration of HA/F200 for MSA 1000 with Windows Enterprise Edition to 
HA/F200 for MSA1000 with Windows 2000 Advanced Server can be performed 
while keeping your cluster online (a rolling upgrade). 

IMPORTANT: Back up all data before beginning the migration process. 

IMPORTANT: Prior to beginning this migration, both nodes running Windows Enterprise 
Edition must have Service Pack 6a for Windows Enterprise Edition installed. 

NOTE: Service Pack 6a must also be reinstalled after removing Secure Path Manager and 
before beginning the Windows 2000 upgrade. 
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The basic steps to this migration include: 

1. Back up all data. 

2. Move all cluster resources over to node 2. 

3. Uninstall any cluster fixes that may be installed. 

4. Run Compaq Primer Utility. 

5. Uninstall Secure Path from node 1. 

6. Apply Service Pack 6a on node 1 again. 

7. Upgrade the operating system and drivers on node 1. 

8. Install Secure Path on node 1. 

9. Move all cluster resources back to node 1. 

10. Upgrade the operating system and drivers on node 2. 

1 1 . Install Secure Path on node 2. 

To migrate your cluster: 

1. Back up all data. 

2. Migrate node 1: 

a. Move all of the resource groups, including the cluster group, to node 2 using 
the Cluster Administrator Utility. 

b. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 

c. Right-click Cluster Group and click Move Group. 

d. Repeat step c until all resource groups listed in the Cluster Administrator 
display have been moved to node 2. 

NOTE: For help on moving cluster groups, refer to the Microsoft clustering documentation 
that came with your operating system. 

3. Using Add/Remove Programs in the Control Panel, remove any cluster fixes 
that might be installed on node 1 . 
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4. Run the Compaq Windows 2000 Primer Utility to remove any conflicting device 
drivers or applications that might cause problems during the install or operation 
of Windows 2000 Advanced Server: 

NOTE: The Primer Utility only removes Compaq Utilities that are not compatible with 
Windows 2000. The Primer Utility can be downloaded from the following Compaq website: 

www.compaq.com/partners/microsoft 

a. Execute the Primer Utility and click Cleanup to begin the process that 
removes Windows 2000 non-supported utilities. 

b. After the Primer Utility has run, you will be asked to reboot the node to 
finish the cleanup process. 

5. Using Add/Remove Programs in the Control Panel, remove Secure Path from 
node 1. 

6. Apply Service Pack 6a to node 1 and reboot when prompted. 

7. Upgrade node 1 to Windows 2000 Advanced Server: 

a. After the node has been rebooted, insert the Windows 2000 Advanced Server 
CD into the CD-ROM drive. If the Autorun feature for CDs has been turned 
off, click Start, Run. At the prompt, enter 

[CD-ROM DRIVE] : \Setup.exe 

b. When asked if you want to upgrade Windows, select Yes. 

c. Select the Upgrade to Windows 2000 option and click Next. 

d. Select the I accept this agreement option and click Next. 

e. During the upgrade, the Microsoft setup process will automatically restart the 
computer after 15 seconds or click Finish to restart manually. The upgrade 
process is automatic and will take some time. 

f. When the upgrade is complete, log on to the node as the administrator. 

8. Install Windows 2000 Advanced Server Service Pack 2 on node 1 and reboot 
when prompted. 
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9. Install the latest Compaq drivers and server support software by running the 
Compaq Support Paq for Windows 2000 on node 1 : 

a. Insert the SmartStart CD and click Start, Run. 

b. In the displayed dialog box, enter 

[CD-ROM DRIVE] : \cpqsupsw\ntcsp\setup . exe 

NOTE: You can also acquire the latest Compaq Support Paq from the Compaq website: 
www.compaq.com/support 

After the installation is completed, the Compaq Support Paq window will 
show the drivers that were updated and the drivers that were not installed 
because a current version of the driver exists on the node or because a lack of 
hardware on the node. 

NOTE: Allow time for Windows 2000 to find any new devices. If new devices are 
detected, Windows will prompt you with a dialog box to reboot; click Yes and reboot. This 
is necessary for a successful migration. 

NOTE: After the Windows 2000 Advanced Server upgrade is complete the Cluster 
Service may fail to start since the DNS client for node 1 has not been set up. The problem 
that occurs is an authentication error with the user account used to start the Cluster 
Service. The error message is expected and will be resolved after the DNS client is set 
up. 

10. If DNS is not already installed, set up the DNS client on node 1: 

a. Right-click My Network Places and click Properties. 

b. Right-click the network connection that the clients will use to connect to the 
domain controller and click Properties. 

c. The Properties screen displays the current options available for the selected 
network card. Highlight Internet Protocol (TCP/IP), and click Properties. 

d. Select Use the following DNS server addresses. 

e. In the Preferred DNS server field, enter the Preferred DNS server IP 
address. 

f. Click Advanced. 

g. Select the DNS tab. 
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h. In the DNS server addresses, in order of use field, enter the DNS suffix. 

i. Click OK. 

j. After the client setup is complete, click Start, Shutdown. 

k. From the shutdown screen, select Log Off Administrator and log back into 
the system. As the administrator. (This ensures that the domain controller has 
properly authenticated the machine and user ID.) 

1 1 . Uninstall the Emulex LP8000 Fibre Channel host adapter driver: 

a. Click Start, Programs, Administrator Tools, Computer Management. 

b. Within Computer Management, click Device Manager. 

c. In the right-hand pane under SCSI and Raid Controllers, uninstall the 
Emulex LP8000 Fibre Channel host adapter driver. 

12. Insert the Compaq StorageWorks Modular SAN Array 1000 Support Software 
CD: 

a. Install the HBA drivers. 

b. Reboot as requested. 

13. Install Secure Path on node 1: 

a. Insert the Secure Path CD to automatically start the Secure Path installation 
process or double-click the following file on the CD: 

[CD-ROM DRIVE] : \SPInstall\setup . exe 

b. Configure your clients during the installation. 

c. Remove the Secure Path CD from the CD-ROM drive. 

d. Reboot the node when prompted. 

NOTE: For more detailed information on Secure Path, refer to the Secure Path 
documentation. 
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14. Verify that the Cluster Service has started and that node 1 has rejoined the 
cluster: 

a. Open Cluster Administrator by clicking Start, Programs, Administrative 
Tools, Cluster Administrator. 

b. As the Cluster Administrator opens, an error might display. This error is a 
result of the additional resources that Windows 2000 Advanced Server offers 
Microsoft Cluster Server. This error will not display when node 2 is 
upgraded to Windows 2000 Advanced Server. 

c. From the error screen, click Yes To All to open Cluster Administrator. 

15. Upgrade node 2 by repeating steps 1 through 14 for node 2 and then continuing 
with step 16. 

16. Verify that Secure Path is running properly and that the redundant paths are 
operational by starting the Secure Path Manager Utility. 

NOTE: See the Secure Path documentation for details on running this program. 

17. Install your applications, managing and monitoring software as needed. These 
may include Compaq Insight Manager, Compaq Insight Manager XE, Compaq 
Insight Manager 7, Compaq Intelligent Cluster Administrator, and other 
applications needed on this cluster. 

The migration of HA/F200 for MSA 1000 with Windows Enterprise Edition to 
HA/F200 for MSA1000 with Windows 2000 Advanced Server is complete. 
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Managing the Compaq ProLiant Clusters 
HA/F100 and HA/F200 for MSA1000 



Throughout the life of a cluster, there will be the need to improve the performance, 
upgrade hardware components, upgrade software, increase storage capacity, 
restructure cluster groups, back up cluster data, and monitor ongoing activities. This 
chapter describes these management concepts as they pertain to clusters. The chapter 
also details the utilities and programs used in the ongoing management of Compaq 
ProLiant Clusters HA/F100 and HA/F200 for MSA1000. The topics addressed in this 
chapter include: 

• Modifying Physical Cluster Resources 

• Backing up your Cluster 

• Managing Cluster Performance 

• Using Secure Path Manager 
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Modifying Physical Cluster Resources 
Removing a Shared Storage System 

Each cluster node, and the cluster as a whole, depends on the shared storage for data, 
log, and, possibly, application program files. If the shared storage is removed, all 
clustered applications that are dependent on that shared storage system will be 
offline. Network clients will not have access to the clustered applications. 

NOTE: These procedures for removing a shared storage system assume that you are making 
modifications to a previously configured HA/F100 or HA/F200 for MSA1000. 

To remove a shared storage from your cluster: 

1 . Use Cluster Administrator to bring offline all cluster groups and cluster resources 
including physical disk resources dependent on the MSA 1000 storage system you 
are about to remove from the cluster. 

2. Delete the cluster groups and resources you brought offline in step 1. 

3. Remove appropriate LUNs using ACU. 

4. Power off the MSA1000 storage system you are about to remove. Remove the 
Gigabit Interface Converter (GBIC) and the cable from the Fibre Channel storage 
hub or Fabric switch. 

Adding a Shared Storage System 

A Compaq StorageWorks Modular SAN Array 1000 cannot be dynamically added to 
the cluster. 

NOTE: These procedures for adding a shared storage system assume that you are making 
modifications to a previously configured HA/F100 or HA/F200 for MSA1000 cluster. 

To add another shared storage system to an existing cluster: 
1 . Power down one of the cluster servers (node 2). 
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2. Insert the Compaq Modular SAN Array Support Software CD into the CD-ROM 
drive of the other cluster server (node 1). Power down node 1. 

3. Insert all of the SCSI drives into the MSA 1000 storage system. 

IMPORTANT: If using a Compaq ProLiant Cluster HA/F200 for MSA1000 with redundant 
paths, be sure to attach both array controllers, one to each Fibre Channel storage hub or 
Fabric switch. 

4. Connect the additional MSA1000 storage system to the Fibre Channel storage 
hub or Fabric switch. Follow the hardware installation steps detailed in Chapter 3 
of this guide. 

5. Power up the newly added MSA 1000 storage system. 

6. Power up node 1 and run the ACU XE. Create a new drive. Refer to the Compaq 
StorageWorks Modular SAN Array 1000 User Guide for more information about 
the ACU. 

IMPORTANT: Be sure to understand your cluster groupings for applications and resources 
whose data, log, and, possibly, program files will reside on these drives. 

7. Save the configuration changes and exit ACU XE. Remove the Compaq Modular 
SAN Array Support Software CD. 

8. Boot node 1 to Windows Enterprise Edition or Windows 2000 Advanced Server, 
then run Disk Administrator for Windows Enterprise Edition or Disk 
Management for Windows 2000 Advanced Server to assign permanent letters to 
the drive volumes and configure the newly added disks for use in a cluster. Refer 
to your Microsoft documentation for more details. 

9. Open Cluster Administrator. The newly created disk is now ready to participate 
in the cluster. Follow the MSCS documentation for details about adding new 
cluster resources. 

IMPORTANT: Do not run the Array Configuration Utility on the second cluster node. The 
information entered from the first cluster node is maintained on the drives; therefore, the drives 
are already configured for the second node. 

10. Power up node 2 and log on to Windows Enterprise Edition or Windows 2000 
Advanced Server. Refer to the MSCS documentation for procedures to verify that 
the newly created cluster resources are operational on node 2. 
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Adding or Removing Shared Storage Drives 

There are two situations in which you might add or remove drives from the shared 
storage: 

NOTE: These procedures for adding or removing shared storage drives assume that you are 
making modifications to a previously configured HA/F100 or HA/F200 for MSA1000 cluster. 

• When one of the drives fails. In this case, you are simply exchanging the failed 
drive for a new one. 

• When you want to add capacity to your MSA 1000 storage system. 

In both situations, it is assumed that you are employing RAID 1+0, 5, or ADG for all 
of the devices in the storage unit. 



A CAUTION: To add or remove drives from the MSA1000 storage system, read the 
documentation that came with your storage system. Failure to follow the steps 
outlined in the storage system documentation may result in loss of data. 



Replacing a Failed Drive 

The procedure for replacing a failed drive is completed within the MSA 1000 storage 
system. Neither MSCS nor the Microsoft Windows operating system is aware of the 
change, and operation of both continues without interruption. 

IMPORTANT: If the failed drive was not configured to use RAID 1+0, 5, or ADG (that is, your 
drives have no fault tolerance), you may lose some or all of the data on the failed drive. 

Refer to the documentation that came with your storage system for instructions on 
replacing a failed drive. 
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Adding Drives to Increase Storage Capacity 

The following steps describe how to add a drive to the Compaq MSA 1000 storage 
system and to allocate it to MSCS. 

NOTE: These procedures for adding drives to increase storage capacity assume that you are 
making modifications to a previously configured HA/F100 or HA/F200 for MSA1000 cluster. 

1 . Power down one of the cluster servers (node 2). 

2. Insert the Compaq Modular SAN Array Support Software CD into the other 
cluster server (node 1). Power down node 1. 

3. Insert new drives in the MSA 1000 storage array. 

IMPORTANT: If using a Compaq ProLiant Cluster HA/F200 for MSA1000 with redundant 
paths, be sure to attach both array controllers, one to each Fibre Channel storage hub or 
Fabric switch. 

4. Power up the node 1 and run the ACU XE. Refer to Compaq StorageWorks 
Modular SAN Array 1000 User Guide for more information about creating a new 
logical drive from free space on an existing array, an expanded array, or a new 
array. 

IMPORTANT: Be sure to understand your cluster groupings for applications and resources, 
whose data, log, and possibly program files will reside on these drives. 

5. Save the configuration changes and exit ACU XE. Remove the Compaq Modular 
SAN Array Support Software CD. 

6. Boot node 1 to Windows Enterprise Edition or Windows 2000 Advanced Server, 
then run Disk Administrator for Windows Enterprise Edition or Disk 
Management for Windows 2000 Advanced Server to assign permanent letters to 
the drive volumes and configure the newly added disks for use in a cluster. Refer 
to the MSCS documentation for more details. 
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7. Open Cluster Administrator. The newly created disk is now ready to participate 
in the cluster. Follow the MSCS documentation for details about adding new 
cluster resources. 

IMPORTANT: Do not run the ACU on the second cluster node. The information entered from 
the first cluster node is maintained on the drives; therefore, the drives are already configured 
for the second node. 

8. Power up node 2 and log on to Windows Enterprise Edition or Windows 2000 
Advanced Server. Refer to the MSCS documentation for procedures to verify that 
the newly created cluster resources are operational on node 2. 

NOTE: You cannot increase the capacity of an existing Windows Enterprise Edition or 
Windows 2000 Advanced Server drive volume, but you can assign a new drive letter to the 
extra capacity furnished by the added drives. 



Physically Replacing a Cluster Node 

To simplify integration of the new cluster node, retain the Windows Enterprise 
Edition or Windows 2000 Advanced Server boot drives from the node being replaced 
for use in the replacement node. If this is not possible, perform the steps described in 
the Chapter 3 "Assisted Integration Using SmartStart (Recommended)" section for 
adding a second node. 

NOTE: These procedures for physically replacing a cluster node assume that you are making 
modifications to a previously configured HA/F100 or HA/F200 for MSA1000 cluster. 



Replacing a Cluster Node 

To replace an existing cluster node: 

1 . Ensure all cluster groups are running on the remaining cluster node (node 1 , for 
example). Fail over to the remaining node any cluster groups that are running on 
the node being replaced (node 2). 

2. Open Cluster Administrator on node 1. Right-click Node 2. Select Evict Node to 
remove node 2 from the cluster. 

3. After node 2 is removed from the cluster, shut down node 2 and power it off. 
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4. Remove the interconnect, LAN, Fibre Channel cables, and power cables. 
Remove any other cables that are attached to the node. If the node is in a rack, 
remove it from the rack. 

5. Remove all hardware devices that will be used in the new node from the node 
being replaced and place them in the new node. Install all other hardware devices 
into the new node. 

6. If the new node is part of a rack system, place the server in the rack. Attach the 
interconnect, LAN, Fibre Channel cables, and power cables. 

If you are recreating the new node, follow the installations steps outlined in 
Chapter 3. 

Installing a New Windows Enterprise Edition or Windows 2000 
Advanced Server Boot Drive 

New Windows Enterprise Edition or Windows 2000 Advanced Server boot drives 
require installation of Windows Enterprise Edition or Windows 2000 Advanced 
Server, configuration of the networking components of the new node, and installation 
of MSCS. Follow the SmartStart Assisted Path installation steps for node 2 outlined 
in Chapter 3. 

NOTE: When installing MSCS, be sure to join the existing cluster. 



Backing Up Your Cluster 

The need for cluster backup is essential for both business-critical and nonbusiness- 
critical data. As data becomes an increasingly valued company asset, it needs to be 
backed up on a regular basis. The process of backing up data ensures that a 
company's assets are secure and available when a disaster strikes. The cluster itself 
provides a high degree of application availability but does not prevent a user from 
deleting or corrupting a file or set of files. Backing up a cluster's data increases the 
overall level of data availability. 
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There are several methods for backing up clustered systems, including: 

• Both cluster nodes attached to a single tape backup device, such as the 
Fibre Channel-based Compaq Storage Works Enterprise Backup Solution. 

• Two-tape backup subsystems where one tape backup subsystem is attached to 
each cluster node. 

• A separate backup server that uses the standard public client LAN as the 
backup path. 

Tape backup software varies in its level of cluster-aware integration and operation. 
Contact your tape backup software vendor for more details regarding their specific 
cluster implementation and support. 

Managing Cluster Performance 

As applications or resources transfer from one server to another, the performance of 
the clustered servers will likely change. The extent of the change depends on how 
well equipped the other node is to handle the increased workload. This change in 
performance is especially obvious after a server failure, where all of the cluster 
resources may have to move to the other server. 

Use the Windows Enterprise Edition or Windows 2000 Advanced Server 
performance monitor utility to observe and track system performance. Some 
applications have their own internal performance measurement capabilities. Constant 
monitoring of each system's performance in a cluster may identify a disparity in the 
workload being performed by each server. Adjust the distribution of the cluster 
applications so that the overall cluster performance is maximized. 

Dynamic load balancing is not available in MSCS. However, MSCS can statically 
load balance your cluster nodes. Information obtained from the performance monitor 
utility can be used to determine whether either of the cluster nodes is operating at too 
high a performance level. Then you can use Cluster Administrator to fail over as 
many cluster groups as necessary to balance the load of each cluster node. 
Additionally, disk I/O can be balanced across the dual host bus adapters in a Compaq 
ProLiant Cluster HA/F200 for MSA 1000 configuration. This functionality is only 
available when accessing two separate RAID arrays. 
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The timing of static load balancing depends on the type of group to be moved and 
how many clients are using the group. File and print services are normally not 
business-critical, nor do they sustain an extremely high utilization rate. For these 
reasons, file and print services are good candidates to move whenever static load 
balancing needs to occur, even when the overall cluster utilization is high. 

However, some applications, such as databases, should not be moved from one server 
to another during peak processing periods. To move a database from one server to 
another, the database must be shut down and restarted. Users do not have access to 
the database during the time it takes the database to restart. Move a database group to 
another server during nonpeak hours. 

Using Secure Path Manager 

Secure Path provides continuous data access for RAID storage systems on 
Windows 2000 Advanced Server. It allows an MSA 1000 storage system to be cabled 
on two independent busses, using two separate host bus adapters in a single server. If 
a failure on one path's host bus adapter, cable, or controller occur, the failure is 
detected and I/O is automatically rerouted to the functioning, alternate path. This 
process is called failover, and requires no resource downtime and ensures high 
availability of data. Failed-over storage units may be failed back, through use of a 
configuration management utility, after failed components have been replaced. 

Launching Secure Path Manager 

To launch Secure Path Manager (SPM): 

1 . Click Start, Programs, SecurePath, and then the SPM submenu. 

2. Click the SPM application icon. 

Logging on to Secure Path Manager 

Logging on to SPM incorporates entering user and storage profiles definitions 
directly from the login window. 
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Defining SPM Storage Profiles 

SPM displays a storage-centric view of Secure Path-managed RAID storage 
resources. All Secure Path-protected RAID storage systems common to a given host 
(or set of hosts) are presented in an SPM display. 

During SPM login, enter hosts that share these RAID storage systems while defining 
storage profiles from the login window. 

• To create a non-clustered host profile, start by entering a host name (or set of 
host names) in the HostlCluster names field. 

• To create a clustered host profile, enter a host name (or set of host names) with 
each followed by a \your clustername designation to identify cluster membership. 

A single instance of SPM is capable of managing: 

• Multiple non-clustered hosts sharing one or more RAID storage systems 

• A single set of clustered hosts sharing one or more RAID storage systems 

More than one instance of SPM is required to manage installations that include a mix 
of non-clustered and clustered hosts. 

After you have added all the host names to your storage profile, enter the connection 
password in the Password field. This is the password that you defined for the Secure 
Path Agent during setup, or when you run the Secure Path Agent Configuration 
utility after installation. 

SPM uses this password to establish a network connection with the Secure Path 
hosts. For storage profiles including more than one host, the connection password 
must be the same on each Secure Path host. Click the Save Password check box if 
you want SPM to use the saved password automatically each time you log in with 
this storage profile. 
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Saving an SPM Storage Profile 

To save an SPM profile: 

1 . Enter a unique name in the Profile field after you define a storage profile. 

2. Save the profile by clicking Save Profile. 

Creating a New SPM Storage Profile 

To create additional SPM storage profiles: 

1. Click New. 

2. Add host name in the HostlCluster Names field. 

3. Enter a profile name in the Profile field. 

4. Click Save Profile. 

Selecting an Existing SPM Storage Profile 

To choose an existing SPM storage profile, use the pull-down arrow on the Profile 
box to find and select the profile. 

If you did not choose to save the password when you originally created the profile, 
enter the password in the Password field and click Login. 

Editing an Existing SPM Storage Profile 

To edit an existing storage profile, select the profile to be edited. Make the desired 
changes to the profile and click Save Profile. 
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Changing the Secure Path Agent Password 

To change the Secure Path Agent password: 

1 . Run the Secure Path Agent Configuration Utility located in the Secure Path 
program folder from the Start Menu. 

2. After you change the Agent's client (SPM) access list or password using the 
Configuration Utility you must stop and restart the Agent using the Windows 
Services Applet located in Control Panel. 

3. Find and select the Secure Path Agent in the list of services and click Stop. 

4. After the Agent has stopped, select Secure Path Agent again and click Start. 

The Agent restarts and updates its client and/or password database. Make sure that 
you perform this procedure for each of the hosts in an SPM storage profile. 

Managing Storagesets and Paths in a Clustered Environment 

You can perform the following actions on the storagesets and paths managed 
by SPM: 

• Move a storageset from one controller to the other 

• Verify a path 

Moving a Storageset from One Controller to the Other 

Choose Move a Storageset when you want to change the ownership from the current 
RAID Array controller to another one. This action is useful if you need to manually 
return a failed-over storageset to its Preferred path when Auto-Failback has been 
disabled. 

To move a storageset: 

1 . Click the drive to highlight it in the storage system view. 

2. Drag the drive to the other controller or right-click to select the Move To Other 
Controller option. 
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Verifying a Path 

Choose Verify a Path when you want SPM to determine the current state of a path. 

To verify a path: 

1. Click the path. 

2. Right-click to select the Verify Path option. 

SPM will generate a pop-up message when the verification is complete to indicate the 
result of the operation. No state change will occur as a result of this operation. 

NOTE: For more information on Secure Path, refer to the Secure Path documentation 
provided with the Secure Path CD. 

MSA1000 Controller Hot Replace 

In an HA/F200 cluster, an MSA1000 Controller can be replaced in the MSA1000 
storage system without powering down the storage system or taking the cluster 
offline. This procedure is called "hot replace." 

To replace the MSA 1000 Controller: 

NOTE: During this procedure, Auto-Failback and Path Verification can be either enabled or 
disabled. 

1 . Identify which controller needs to be replaced from the Secure Path Manager 
screen. 

2. Remove the MSA1000 Controller. It can be either the active or standby 
MSA 1000 Controller. 

3. Wait for Secure Path to acknowledge the failed path. 

4. Insert the replacement controller. Wait for the controller to display "Redundancy 
Active." 
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5. Initiate a rescan by pressing the F5 key or by selecting View - Refresh in the 
Secure Path Manager. 

6. Click on any failed MSA1000 Controllers in the right pane and choose Verify 
Path or Repair Path, if necessary. 

7. Make a note of the change in the serial number of the replaced MSA 1000 
Controller. 

NOTE: In order to swap in a previously removed controller into its original slot/location, the 
system might need to be rebooted. 
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Troubleshooting the Compaq ProLiant 
Clusters HA/F100 and HA/F200 for 

MSA1000 



This chapter addresses problems encountered while installing, configuring, testing, 
and operating the Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA 1000. 
These problems are described in the following troubleshooting categories: 

• Installation 

• Node-to-Node 

• Shared Storage 

• Client-to-Cluster Connectivity 

• Cluster Groups and Cluster Resources 

• Compaq SAN works Secure Path for Windows Workgroup Edition 

• Compaq Storage Works FCA-2101 Host Bus Adapter 



Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA1000 Administrator Guide 



7-1 



Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 for MSA 1000 



Installation 

Table 7- 1 addresses problems encountered during installation. 

NOTE: MSCS describes Microsoft Cluster Server (Windows NT Server 4.0, Enterprise 
Edition) or Cluster Service (Windows 2000 Advanced Server) 



Table 7-1: Solving Installation Problems 



Problem 


Possible Cause 


Action 


The error message "RPC 
Server is Unavailable" is 
displayed. 


Name resolution issue 


1 . Ensure that the server is 
operational and that the 
Cluster Service and the 
RPC services are running. 

2. Check the name 
resolution of the cluster. 
It is possible that you are 
using an incorrect name 
or that the name is not 
being properly resolved by 
WINS or DNS. 


Cluster Administrator does not 
appear in the Start menu. 


MSCS might not have been 
successfully installed. 


1. Uninstall MSCS (or any 
pieces that might have 
installed incorrectly). 

2. Reinstall MSCS. 


Node Performance is sluggish 
and the node fails. 


Processor might be 
overloaded. 


Verify that the processor is not 
running at or near 100% 
utilization. 




Paging file is too small. MSCS 
might view the size deficiency 
as a node failure and cause a 
failover event. 


Check the size of the 
Windows Enterprise Edition or 
Windows 2000 Advanced 
Server paging file. 



continued 
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Table 7-1: Solving Installation Problems continued 



Problem 



Possible Cause 



Action 



MSCS installation will not 
complete on the first node. 



Insufficient space exists on 
nonshared drives for MSCS. 



Ensure that there is enough 
free disk space for MSCS 
installation files. 



Operating system is incorrect 
or deficient. 



Ensure that Windows 
Enterprise Edition and 
appropriate Service Pack or 
Windows 2000 Advanced 
Server is installed. 



Primary IP address is invalid. 



Verify that the addresses are 
valid. If DHCP is used to 
obtain noncluster IP 
addresses, run 

IPCONFIG.EXE to ensure that 
the network adapter cards 
have valid IP addresses. If the 
second IP address listed is 
0.0.0.0, your primary address 
is invalid. 



Cluster nodes need to be 
rebooted. 



Reboot cluster nodes after 
installing MSCS. 



The physical disk resource 
cannot be brought online. 



Hardware errors or 
communications transport 
problems exist. 



Use Event Viewer to verify that 
no hardware errors or 
transport problems exist. 
Check the event log for disk 
I/O error messages or 
indications of problems with 
the communications transport. 



You might not have waited 
long enough to view the most 
current data. Cluster 
Administrator takes a snapshot 
of the registry when it 
starts up. 



Wait a minute, then click 
Refresh (F5). 



continued 
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Table 7-1: Solving Installation Problems continued 



Problem 



Possible Cause 



Action 



Added logical drives are not 
recognized. 



Windows Enterprise Edition 
and Secure Path do not 
support hot-adding of logical 
drives. 



To add logical drives: 

1 . Add drives in Disk 
Administrator. 

2. Add drives to cluster. 



Secure Path seems to function 
properly but does not 
recognize component failover. 



Cluster might be improperly 
cabled. 



Refer to the Compaq ProLiant 
Cluster HA/F200 for MSA1000 
Configuration poster for proper 
cabling of the cluster. 



Node-to-Node 

Table 7-2 describes problems that may be encountered during node-to-node 
communication. 

Table 7-2: Solving Node-to-Node Problems 



Problem Possible Cause Action 



The resources failed over No connectivity between 1 . Run Control Panel service applet 
but the nodes do not see the nodes. on both nodes and verify that the 

each other. RPC service is running. 

2. Use a network analyzer (such as 
Network Monitor), or RPCPing 
(available on the Microsoft 
Exchange Server CD) to ensure 
that RPC connectivity exists on 
both nodes. 

Secure Path does not Cluster might be Refer to the Compaq ProLiant Cluster 

recognize node failover. improperly cabled. HA/F200 for MSA1000 Configuration 

poster for proper cabling of the cluster. 

continued 
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Table 7-2: Solving Node-to-Node Problems continued 
Problem Possible Cause Action 



The second node cannot Improper name resolution 1. Data in the WINS or DNS server 
join the cluster. exists. might be incorrect for the 

second node. 

2. Ensure that the proper cluster 
name, node name, or IP address 
is used. 



3. Verify that Cluster Name resource 
has started. 



Cluster Service is not 


Verify that Cluster Service is running 


running. 


on the first node before attempting to 




join the second node. 


No network connectivity 


Check network connectivity. 


exists. 




TCP/IP is not configured 


Verify TCP/IP configuration on both 


properly. 


nodes. 



No IP connectivity. Verify IP connectivity to the cluster 

address. If unable to ping the IP 
address of the cluster, run Cluster 
Administrator on the first node. Ensure 
that the cluster IP address resource is 
running. 

Primary IP address is 1. Check the cluster IP address 

invalid. resource properties. Ensure that 

the cluster has a valid IP address 
and subnet mask. Ensure IP 
address does not conflict with an 
existing IP address on the 
network. 

2. Verify that addresses are valid. If 
DHCP is used to obtain noncluster 
IP addresses, run IPCONFIG.EXE 
to ensure that the network adapter 
cards have valid IP addresses. If 
the second IP address listed is 
0.0.0.0, your primary address is 
invalid. 
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Shared Storage 

Table 7-3 addresses problems encountered using the MSA1000 storage system as a 
shared storage device. This section does not address MSA 1000 storage system 
problems specific to the storage system itself, nor does it address the storage system 
in a stand-alone server configuration. For those issues, see the documentation that 
came with your storage system. 

Table 7-3: Solving Shared Storage Problems 



Problem Possible Cause Action 



Drives in the MSA1000 You might not have waited Wait a minute, then click Refresh (F5). 
storage system are not long enough to view the 
recognized. most current data. Cluster 

Administrator takes a 

snapshot of the registry 

when it starts up. 

Cluster nodes need to be Reboot cluster nodes after installing 
rebooted. MSCS. Ensure that the drives are 

recognized. 

Host bus adapter driver is Ensure that the host bus adapter driver 
not installed. for Windows Enterprise Edition or 

Windows 2000 Advanced Server is 
installed and running on both servers. If it 
is not installed, refer to the documentation 
that came with your host bus adapter. 

Use Event Viewer to verify that no 
hardware errors or transport problems 
exist. Check the event log for disk I/O 
error messages or indications of problems 
with the communications transport. 

Compaq ProLiant Clusters Use Event Viewer to verify that no 
do not support the hardware errors or transport problems 

physical SCSI disks. exist. Check the event log for disk I/O 

error messages or indications of problems 
with the communications transport. 

continued 



Hardware errors or 
communications transport 
problems exist. 
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Table 7-3: Solving Shared Storage Problems continued 



Problem 



Possible Cause 



Action 



Drives in the MSA1000 
storage system are not 
recognized. 



Possible drive 
configuration problems 
exist. 



1 . Run the Compaq Array Configuration 
Utility (ACU) online. This utility can be 
run online if at least one logical drive 
is configured and recognized. 

2. Run the ACU XE. Shut down 
the servers and reboot with the 
Modular SAN Array 1000 Support 
Software CD. 

3. If all drives are not recognized by the 
ACU, verify that all GBIC-SW 
modules are properly seated. 

4. Verify that all Fibre Channel cables 
are properly connected to the 
GBIC-SW modules. For details on 
how to connect the GBIC-SW 
modules and fiber-optic cables, see 
the documentation that came with 
your storage system. 

5. When all drives are recognized and 
configured correctly, run Disk 
Administrator or Disk Management. 

6. Verify that all drive volumes appear in 
Disk Administrator or Disk 
Management and that each has a 
permanent drive letter assigned. 

7. Verify that the drive letter 
assignments on the second cluster 
node match the assignments on the 
first cluster node. 



Data on shared 
storage seems to be 
overwritten. 



MSCS might not be 
loaded and therefore 
cannot manage access to 
drive volumes in the 
shared storage. 



1 . Ensure that MSCS is loaded on both 
cluster nodes. 

2. Using the Control Panel Service 
applet on each node, verify that the 
cluster service is running. 



continued 
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Table 7-3: Solving Shared Storage Problems continued 



Problem 



Possible Cause 



Action 



The second node 
cannot connect to the 
shared drives. 



Physical connection from 
the second node to the 
Fibre Channel storage hub 
or Fabric switch does not 
exist. 



Verify that all Fibre Channel cables are 
properly connected to the GBIC-SW 
modules. For details on how to connect 
the GBIC-SW modules and the Fibre 
cables, refer to the documentation that 
came with your storage system. 



Shared MSA1000 drives 
are not assigned the same 
drive letters on both 
nodes. 



Run Windows Enterprise Edition Disk 
Administrator or Windows 2000 Advanced 
Server Disk Management on each node to 
verify that all shared drives are assigned 
identical, permanent drive letters. 



The second node was 
powered up before the 
Fibre Channel storage 
hub, Fabric switch, or the 
MSA1000 storage system. 



Shut down the second node, power it 
down, and then power it up. 



Secure Path shows the 
cluster in 

non-redundant mode. 



You are using ACU to 
expand capacity. 



The firmware will automatically put the 
MSA1000 storage system in non- 
redundant mode during capacity 
expansion. This is normal behavior. 



Interconnect link (ICL) has 
failed. 



Check the Windows Enterprise Edition or 
Windows 2000 Advanced Server Event 
Log to determine if the ICL has failed. 



Failover may have 
occurred. 



Check to see if a failover has occurred. It 
is normal behavior for a drive rebuild to 
restart if a failover has occurred. 



continued 
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Table 7-3: Solving Shared Storage Problems continued 



Problem 



Possible Cause 



Action 



Storage performance is 
marginal. 



Cache modules on the 
array controllers do not 
match. 



Replace the cache module so that cache 
levels match on both array controllers. 



Windows Enterprise 
Edition or Windows 
2000 Advanced Server 
Event Log states: The 
Host Bus Adapter in 
slot 1 has averaged 
more than 2 Elastic 
Store Errors over the 
five seconds. The loop 
has become unstable. 
The adapter has been 
taken offline. 



GBIC-SW laser has 
malfunctioned. 



1 . Refer to the documentation that came 
with your storage system for 
instructions on replacing a GBIC-SW. 

2. Manually fail back resources. 



Windows Enterprise 
Edition or Windows 
2000 Advanced Server 
Event Log states: The 
Host Bus Adapter in 
slot 1 has averaged 
more than 2 Soft PCI 
Errors over the last five 
seconds. The loop has 
become unstable. The 
adapter has been 
taken offline. 



GBIC-SW laser has 
malfunctioned. 



1 . Refer to the documentation that came 
with your storage system for 
instructions on replacing a GBIC-SW. 

2. Manually fail back resources. 



Windows Enterprise 
Edition or Windows 
2000 Advanced Server 
Event Log states: The 
Host Bus Adapter in 
slot 1 has averaged 
more than 2 Reset 
Login Primitives over 
the last five seconds. 
The loop has become 
unstable. The adapter 
has been taken offline. 



GBIC-SW laser has 
malfunctioned. 



1 . Refer to the documentation that came 
with your storage system for 
instructions on replacing a GBIC-SW. 

2. Manually fail back resources. 
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Client-to-Cluster Connectivity 

Table 7-4 addresses problems that might be encountered in cluster-to-LAN 
communication. 

NOTE: The cluster is assigned one or more Net BIOS names associated with an IP address. 
Network clients communicate with the cluster through TCP/IP. 



Table 7-4: Solving Client-to-Cluster Connectivity Problems 



Problem 



Possible Cause 



Action 



Clients do not 
see the cluster. 



TCP/IP is not configured 
properly. 



Verify that TCP/IP is configured on all network 
clients and on the cluster nodes. 



Clients might not be running 
the correct operating system. 



Verify that clients are running 

Windows NT 4.0 or Windows 2000 and the 

latest Service Pack. 



Clients do not 
see virtual 
servers. 



Virtual servers may not have 
their own IP addresses or 
Network Name resources. 



Use Cluster Administrator to verify that both 
resources are online. 



Client protocol problem exists. 



1 . Verify that the client is accessing the 
cluster using the correct cluster network 
name or IP address. 

2. Verify that the client has TCP/IP protocol 
correctly installed and configured. 



Resource name resolution 
problem may exist. 



Use NetBT cache (NBTSTAT.EXE on the 
Windows Enterprise Edition CD) to determine 
if the name had been previously resolved. 



Improper WINS configuration. 



Confirm proper WINS configuration at the 
client using the WINS administrator. If WINS 
is not used, the hosts or Imhosts file will 
contain entries for each virtual server's 
network name and IP address. 



IP address problem exists if 
the client is accessing the 
resource through a specific IP 
address. 



Ping the IP address of the cluster resource 
and cluster nodes from a DOS command 
prompt. 



continued 
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Table 7-4: Solving Client-to-Cluster Connectivity Problems continued 



Problem 



Possible Cause 



Action 



Clients cannot 
access 

resources on a 
cluster node. 



The corporate LAN Ethernet 
card has failed, but the private 
interconnect (cluster 
communication) continues to 
operate. The clients no longer 
have access to their primary 
cluster node and are failed 
over to the secondary cluster 
node. The applications in use 
by the clients remain on the 
primary cluster node because 
the private interconnect is still 
operating, precluding a failover 
of the applications. 



1. Manually fail over each of the 
applications from the primary server to 
the secondary server. Make sure 
automatic tailback is disabled, to prevent 
the application from failing back to the 
inaccessible primary server. 

2. Implement a redundant interconnect/LAN 
strategy. Install three PCI network cards 
per server. Set up one as a private 
interconnect configured for cluster 
communication only. Set up the other 
cards for client access (the LAN) and 

for cluster communication. Configure 
the LAN network cards using the 
TLAN Teaming and Configuration Utility 
(for ThunderLAN NICs) or the Network 
Teaming and Configuration Utility (for 
Intel NICs). Configured in this way, the 
private interconnect has two backups and 
the primary LAN network card has a 
backup as well. 



Clients cannot 
access cluster 
resources. 



WINS- or DNS-related 
problems exist. 



1 . Verify that the client machines are 
configured with WINS or DNS. 

2. Verify that the cluster nodes are 
configured with either WINS or DNS. 

3. If DNS is used for the cluster nodes, 
make sure a DNS address record for the 
cluster exists in the DNS database, 
particularly if the client is on a different 
subnet than the cluster. 

continued 
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Table 7-4: Solving Client-to-Cluster Connectivity Problems continued 
Problem Possible Cause Action 



Clients cannot Networking problem exists, 
access a group 
that has 
failed over. 



1. Ensure that a physical connection 
between the clustered nodes exists. 

2. Verify that the network cabling is not 
damaged or loose on the surviving node. 

3. Verify that MSCS is able to receive the 
heartbeat of the surviving node and has 
properly failed over the resources. 

4. Verify that the failed over group is a 
virtual server (that is, the group contains 
an IP address as a resource). 

5. If a Fibre Channel storage hub or Fabric 
switch joins the clients to the cluster, 
check the physical connection between 
the clients and the Fibre Channel storage 
hub or Fabric switch and between the 
Fibre Channel storage hub or Fabric 
switch and both cluster nodes. 
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Cluster Groups and Cluster Resources 

Microsoft Cluster Administrator solves many group and cluster resource problems. 
For troubleshooting tips on this topic, refer to the Microsoft clustering documentation 
that came with your operating system and Cluster Administrator's Online Help. 



Table 7-5: Solving Other Problems 



Problem 


Possible Cause 


Action 


An application starts 
but cannot be closed. 


Cluster Administrator must 
be used to take 
applications offline. 


Use Cluster Administrator to take the 
application offline. Do not close or exit an 
application from the application interface. 


A resource hangs 
when taken offline. 


Some resources take time 
to go offline. 


Wait several minutes, then check any 
dependencies that the resource may have. 
Verify that each can be taken offline. 


An IP address added 
to a cluster group fails. 


IP address configuration 
may be incorrect. 


1 . IP addresses must be different from 
every other IP address resource and 
from every other IP address on the 
network. 

2. The IP address must either be 
reserved by the network administrator 
or statically assigned outside of a 
DHCP scope. 


A resource fails over 
but does not fail back. 


Groups may be configured 
to fail back manually, or 
the automatic tailback may 


1 . If expecting an automatic tailback, 
verify that the group is not set for a 
manual tailback. 



be scheduled to occur 

during a specific time 2 - lf expecting an immediate automatic 

frame. tailback, verify that the group is not 

configured to fail back only during a 
user-defined timeframe. 
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Compaq SANworks Secure Path for Windows 
Workgroup Edition 

For detailed information on troubleshooting Secure Path, refer to the Compaq 
SANworks Secure Path Installation and Reference Manual and Release Notes 
included with the HA/F200 for MSA 1000 cluster kit. 

Compaq StorageWorks FCA-2101 Host Bus Adapter 

The FCA-2101 host bus adapter comes equipped with two LEDs that are used to 
display the state of the host bus adapter. Table 7-6 lists the states in which you may 
find the host bus adapter. 



Table 7-6: FCA-2101 Host Bus Adapter States 



Green LED 


Yellow LED 


State 


OFF 


OFF 


Wake-up failure (dead board) 


OFF 


ON 


POST failure (dead board) 


OFF 


Slow blink (1 Hz) 


Wake-up failure (dead board) 


OFF 


Fast blink (4 Hz) 


Failure in POST (dead board) 


OFF 


Flashing (irregular) 


POST processing in progress 


ON 


OFF 


Failure while functioning 


ON 


ON 


Failure while functioning 


ON 


Slow blink (1 Hz) 


Normal for the 1-Gb link rate 


ON 


Fast blink (4 Hz) 


Normal for the 2-Gb link rate 



continued 
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Table 7-6: FCA-2101 Host Bus Adapter States continued 



Green LED 


Yellow LED 


State 


Slow blink M Hz} 

VJIuVV k_/ I I 1 1 l\ 1 1 1 1 L— J 


OFF 


Normal when the link k down nr not vet 

1 N \J 1 1 1 IC4-I VV 1 1 \j II L 1 1 \j 1 1 1 1 1\ 1 O \A w V V 1 1 \J 1 1 1 w L y w I 

started 


Slow blink (1 Hz) 


ON 


Not defined 


Slow blink (1 Hz) 


Slow blink (1 Hz) 


Offline for download 


Slow blink (1 Hz) 


Fast blink (4 Hz) 


Restricted offline mode (waiting for restart) 


Slow blink (1 Hz) 


Flashing (irregular) 


Restricted offline mode, test active 


Fast blink (4 Hz) 


OFF 


Debug monitor in restricted mode 


Fast blink (4 Hz) 


ON 


Not defined 


Fast blink (4 Hz) 


Slow blink (1 Hz) 


Debug monitor in test fixture mode 


Fast blink (4 Hz) 


Fast blink (4 Hz) 


Debug monitor in remote debug mode 


Fast blink (4 Hz) 


Flashing (irregular) 


Debug monitor output active 
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Software and Firmware Versions 



The following table lists supported software and firmware versions for the Compaq 
ProLiant Clusters HA/F100 and HA/F200 for MSA1000. 

IMPORTANT: For information about software and firmware updates recommended or 
required for your Compaq ProLiant Cluster, refer to the Compaq High Availability website 
(www.compaq.com/highavailability). 



Table A-1 : Supported Software/Firmware Versions 



Compaq SmartStart for Servers CD 


5.2 or later 


Modular SAN Array 1000 Support Software CD 


5.3 or later 


Compaq Support Paq for Microsoft Windows NT 4.0 and Microsoft 


5.04A or later 


Windows 2000 




Compaq StorageWorks Modular SAN Array 1000 Controller firmware 


1.10 or later 


Compaq SANworks Secure Path V3.1B for Windows Workgroup Edition 


3.1 B or later 


Microsoft Windows NT Server 4.0 Service Pack 


6a or later 


Microsoft Windows 2000 Service Pack 


2 or later 


Array Configuration Utility 


2.80 or later 


Array Configuration Utility XE 


1 .30 or later 


Compaq Insight Manager 


5.3 or later (optional) 


Compaq Insight Manager XE 


2.0 or later (optional) 


Compaq Insight Manager 7 


5.3 or later (optional) 


Compaq Intelligent Cluster Administrator 


2.1c or later (optional) 
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Active/active 

A state in which each host bus adapter is processing data when the cluster is operating 
normally. 

Active/standby 

A state in which one host bus adapter is processing data and the other is in a booted, but 
inactive, state when the cluster is operating normally. The standby node does not execute its 
own clustered applications and has no cluster groups to fail over to the active node. 

Arbitrated loop 

A connection method in which two or more ports can be interconnected, but only two ports at 
a time can communicate. 

Array controller 

A hardware device that facilitates communications between a host and one or more devices 
organized on an array. Also called MSA 1000 Controller. 

Availability 

A measure of how well a computer system can continuously deliver services to its clients. 
Availability is typically expressed as a percentage, with 100 percent being the best possible 
rating. 

Blink LEDs 

The ability to light the drive tray LEDs on a particular MSA1000 storage system. 
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Business-critical 

Pertaining to applications that are key to business success but not significant enough to justify 
the high cost of a traditional clustering solution. Their failure causes lost revenue, decreased 
productivity, and customer dissatisfaction. 

Cluster 

A group of systems that work collectively as a single system to provide fast, uninterrupted 
computing service. Clustering is a way to increase availability, processing capacity, and I/O 
bandwidth. 

Cluster group 

A collection of interdependent resources that logically represents a clustered client/server 
function. This is a user-definable entity used by the Microsoft cluster software (Microsoft 
Cluster Server for Windows NT or Cluster Service for Windows 2000). 

Cluster software 

See Microsoft clustering software 

Compaq StorageWorks MSA1000 Controller 

A hardware device that facilitates communications between a host and one or more devices 
organized on an MSA 1000 storage system. 

Compaq StorageWorks FCA-2101 

A device that provides an interface between a host system (server) and storage system or 
other devices connected on a Fibre Channel arbitrated loop or fabric. 

Compaq StorageWorks Modular SAN Array 1000 

A storage device including disk drives and one or more resident array controllers. 
Conflict 

A misunderstood command sent by software that causes processing to halt. 
Dedicated interconnect 

A type of interconnect that is used solely for intracluster (node-to-node) communication. 
Communication to and from network clients does not occur over this type of interconnect. It 
is also known as a private interconnect. 
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Dynamic IP address assignment 

An automatically assigned address that comes from an unused IP address pool. When the 
client/server session ends, the IP address is returned to the pool. The server can use fewer IP 
addresses to communicate with clients. Clients do not need to provide a hard-coded address 
to the server. 

Ethernet 

A standard network protocol that primarily operates on a physical level, using network 
interface cards and cabling to transmit data between computers. Transfer rates are normally 
1,000 or 10,000Mb/s. 

Failback 

The process of moving the operation of a previously failed cluster group from the secondary 
cluster node back to the primary node. 

Failover 

The process of moving the operation of a cluster group from the primary node to the 
secondary node in the same cluster. 

Fault tolerance 

A design concept that allows continued operation of a computer entity when a fault (or 
failure) is encountered. Tolerance is achieved primarily by designing redundant elements into 
the entity. 

Fibre Channel Array 

See Compaq Storage Works Modular SAN Array 1000 
Fibre Channel 

An IEEE standard for providing high-speed data transfer among workstations, mainframes, 
supercomputers, desktop computers, storage devices, and display devices. 

Fibre Channel Host Adapter 

See Compaq Storage Works FCA-2101 and host bus adapter 
Fibre Channel Host Controller 

See Compaq Storage Works FCA-2101 and host bus adapter 
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Fibre Channel array controller 

See Compaq Storage Works MSA 1000 Controller 

Heartbeat 

A signal transmitted between cluster nodes to indicate each node's operating state. 
High availability 

A term used to identify a computer system that can continuously deliver services to its clients 
99.9 percent of the time (no more than 8.5 hours of downtime per year). 

Host 

The central or controlling computer in a system of computers connected by communications 
links. 

Host bus adapter 

A card used to connect a peripheral to the host server. Also called host adapter. 
Host bus adapter failover 

The process of moving the operation of all I/O from one host bus adapter to another host bus 
adapter. This can be accomplished manually using Secure Path or automatically on failure of 
one of the adapters. 

Hot-pluggable 

Pertaining to computer components that can be removed and replaced without powering 
down the system. 

Hot spare 

A computer component that is powered on, not actively processing data, and awaiting a 
request to take over operation of a failed component. 

Interconnect 

A physical connection between cluster nodes that transmits intracluster communication. 
Intracluster communication 

The type of communication in which the cluster interconnect is a data path over which nodes 
of a cluster communicate. At a minimum, the interconnect consists of two network adapters 
(one in each server) and a cable connecting the adapters. 
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IP address 

Internet Protocol address. A number that uniquely identifies a host (server) so that computer 
entities can locate and communicate with each other through the transfer of packets. IP 
addresses can be statically or dynamically assigned. 

Load balancing 

To balance the amount of work (processing) undertaken by each node in a cluster. 
Logical disks 

The available logical disks that Secure Path can access displayed in a tree hierarchy format. 
MSCS 

See Microsoft clustering software 
Microsoft clustering software 

The software needed for clustering servers. In Windows NT the software is Microsoft Cluster 
Server. In Windows 2000 the software is the Cluster Service component. 

Mission-critical 

Pertaining to applications essential to business operations. 
MSA1000 

See Compaq Storage Works Modular SAN Array 1000 
Network interface controller 

A board that allows a computer to be connected with a network and that works with the 
network operating system to control the flow of information over the network. 

NIC 

See Network interface controller 
Node 

An individual server in a cluster. 
NTFS 

Microsoft New Technology File System. A file organization system by which data is stored 
and accessed in a Windows NT or Windows 2000 operating system. 
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Paging file 

A Windows NT or Windows 2000 paging file for virtual memory called PAGEFILE.SYS. 
The paging file is not necessarily a single file; it can be a group of files stored in various 
locations, across hard disks and partitions. 

POST 

See Power-On Self-Test 
Power-On Self-Test 

A set of operations executed every time a system is turned on that verifies components are 
present and operating. 

Preferred node 

The principal server an application is configured to operate from. 
Private interconnect 

A type of interconnect that is used solely for intracluster (node-to-node) communication. 
Communication to and from network clients does not occur over this type of interconnect. It 
is also known as a dedicated interconnect. 

Proprietary clustering system 

Traditionally a large, highly scalable UNIX and mainframe system that is costly and 
challenging to set up and maintain. Used only in mission-critical applications that can afford 
no downtime. 

Public interconnect 

A type of interconnect that takes care of communication between the cluster nodes and shares 
the data path with communication between the cluster and its network clients. 

Quorum disk 

A device managed by the Microsoft cluster software that provides a means for persistent 
storage of the cluster configuration information required for failover and fallback events as 
well as for arbitrating ownership of cluster resources. 
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RAID 

See Redundant Array of Inexpensive Disks 
Redundant Array of Inexpensive Disks 

A method of using hard disk drives in an array to provide data redundancy to increase system 
reliability and performance. RAID is classified at the following levels: 

RAID 0 — Data striping. This RAID level stripes data across all drives of the array but does 
not incorporate a method to create redundant data. RAID 0 does not provide fault tolerance. 

RAID 1 — Drive mirroring. This level creates fault tolerance by storing two sets of duplicate 
data on a pair of disk drives. 

RAID 1+0 — Data striping with mirroring. This level stripes data across the drives of the 
array but duplicates the data through mirroring to create a level of fault tolerance. 

RAID 4 — Data guarding. This level involves the use of a single, designated drive containing 
parity data. If a drive fails, the controller uses the data on the parity drive and the remaining 
drives to reconstruct data from the failed drive. 

RAID 5 — Distributed data guarding. This level stores parity data across all the drives in the 
array. Spreading the parity across all the drives allows more simultaneous read operations and 
higher performance than data guarding (RAID 4). 

RAID ADG — Advanced Data Guarding is the fault tolerance level in which data and data 
protection are distributed across all physical disks with protection for failure of up to two 
physical drives. RAID ADG logical drives can sustain two drives failing without data loss. 
Spreading the parity across all the drives allows more simultaneous read operations and 
higher performance than data guarding (RAID 4). RAID ADG requires a minimum of four 
physical disks in the array on which the logical drive is to be created. 

Redundancy 

The provision of multiple, interchangeable components to perform a single function in order 
to cope with failures and errors. A RAID set is considered to be redundant when user data is 
recorded directly to one member and all of the other members include associated parity 
information. 
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Reliability 

The continuous integrity of a system (server, storage, network, or cluster). 
Rescan 

The ability to check for new or lost logical disks or an array controller on the loop. 
Resource 

A software or hardware entity on which a client/server application or service is dependent. As 
it pertains to the Microsoft cluster software, a cluster resource must have the ability to be 
managed by the cluster and must reside on one of the cluster nodes. A resource can be a 
member of only one group. 

Scalability 

The ability of a system (server, storage, network, or cluster) to grow or expand in 
performance or capacity. 

SCSI 

Small Computer Systems Interface. A standard parallel interface for rapid data transmission. 
Service 

A data set or operation set exported by application servers to their clients. 
Shared resource 

A type of cluster organization in which some resources are accessible to all systems in the 
cluster. 

Shared storage clustering 

The cluster architecture in which clustered servers share access to a common set of hard 
drives. Microsoft Cluster Software (MSCS) is based on shared storage clustering and requires 
all clustered (shared) data to be stored in an external storage system. 
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Static IP address assignment 

A unique, nonchanging number assigned to computer entities that allows them to reliably 
communicate with each other. 

System 

A complete computer system capable of operating independently. A system can be either a 
single processor system or an SMP system. Example: a server. 

UPS 

Uninterruptible Power Supply. 
Virtual server 

A cluster group that has an IP address (and optional network name) as one of its resources. 
Network clients can connect directly to a virtual server, which appears to clients as an 
individual server. 
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